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FOURIER TRANSFORM REPRESENTATION OF AN 
IDEAL LENS IN COHERENT OPTICAL SYSTEMS 

Gerald J. Grebowsky 
ABSTRACT 

This document presents a mathematical analysis of the approximations re- 
quired to obtain the Fourier transform representation of an ideal lens. An at- 
tempt is made throughout the paper to demonstrate the physical significance of 
the approximations and the variations from ideal results produced by neglected 
terms in the mathematical formulation. The approximations involved are con- 
sidered in terms of the output signals in optical spectrum analyzer, optical im- 
aging, and optical correlator systems. 
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FOURIER TRANSFORM REPRESENTATION OF AN 
IDEAL LENS IN COHERENT OPTICAL SYSTEMS 


INTRODUCTION 

In recent years there has been a growing interest in the application of optical imaging tech- 
niques for the purpose of processing data signals. These efforts are largely based on the interpre- 
tation of optical imaging systems as spatial filters 1 . By introducing Fourier transform methods 
the relation between an object and its image has the same form as the relation between the input 
and output signals of an electrical system. Comparing the transform of the object and image, the 
imaging process (unity magnification is assumed here) can be described by the expression: 

1 (•*•«• ''y) ' T ('V u y ) °('V "y ) 

Image spectrum = Transfer function x Object spectrum 

This expression has the same form as that for an electrical network except that the spectrums are 
two-dimensional. Since optical objects and images are two-dimensional, a Fourier transform must 
be taken with respect to each of the two spatial coordinates instead of the single time coordinate 
which appears in electrical signals. 

It is the transfer function relation given above which leads to the spatial filter interpretation of 
optical imaging systems. The optical transfer function T(co x) oj y ) is a characteristic of the optical 
elements in a system. An ideal imaging system would have a transfer function T(oo x , co y ) which is 
constant over the object frequency range. In such an ideal system the image would be an exact 
replica of the object. The corresponding electrical system would have a flat frequency response 
over the bandwidth of the input signal. 

It may appear at first thought that the transfer function notation is nothing more than an arbi- 
trary selection of notation. However, in optical imaging systems, an object represented by a si- 
nusoidal variation of light amplitude is imaged as a sinusoidal variation even in the presence of 
aberrations 2 . Aberration effects result in reduced contrast and a lateral shift of the sinusoidal 
image. Thus using sinusoidal test gratings it is possible, in theory, to experimentally determine 
the transfer function for a given optical system. In general, the optical transfer function can have 
complex values. The magnitude is related to the reduction in contrast, and the phase is related to 
the lateral shift of the image. In an actual system the transfer function will not have constant ampli- 
tude and phase as for the ideal imaging system described above. 

Since the optical transfer function is determined by comparing the output image to the original 
object input, introducing any additional element into the optical system to vary the amplitude and/or 
phase transmission properties will change the optical transfer function. To utilize an optical system 
as a spatial filter in a fairly direct manner, it is necessary to know what amplitude and phase vari- 
ations should be inserted and where they should be inserted. Otherwise, obtaining a particular 
transfer function for a spatial filter application would be a trial and error proposition. This im- 
plementation problem is solved for many cases of practical importance by the optical Fourier trans- 
form representation which is discussed in this report. 


Within certain limitations the light amplitude distribution in the back focal plane of a lease is 
proportional to the two-dimensional Fourier transform of the light amplitude distribution of a two- 
dimensional object inserted on the front side of the lens. Within the range of validity for this opti- 
cal Fourier transform representation, the transfer function is varied by a multiplicative factor 
represented by the amplitude and phase transmission properties of an element inserted into the 
back focal plane of the lens. For example, to set the transfer function equal to zero for a particu- 
lar frequency component, the light passing through the corresponding point in the back focal plane of 
the lens is simply blocked. * 

The mathematical development of the optical Fourier transform representation presented in 
this report is intended to clarify the limitations and interpretation of the Fourier transform opera- 
tion of a lens. The derivation is based on the Rayleigh-Sommerfeld diffraction formula and optical 
paths defined by geometrical ray tracing. As each limitation is introduced, an attempt is made to 
describe the effects on the accuracy of the optical Fourier transform representation. Such detailed 
consideration has been found lacking in available treatments 3 of the derivation and is the main pur- 
pose for the development presented in this report. 

FOCAL PROPERTIES OF A LENS 

To derive the formula for a focussed diffraction pattern, we will define the properties of an 
ideal lens. We will restrict our discussion to the case of an ideal lens and ignore the effects of 
lens aberrations, and diffraction at the edge of the lens. We assume that these effects can be taken 
into account by modifying our end result or by restricting the range of variables to a region in which 
our ideal assumptions are valid within experimental accuracies. 

Our definition of an ideal lens will be based on the geometrical focussing properties shown in 
Figure 1. The properties assumed can be stated as follows: 

1. The lens can be represented by a plane L perpendicular to the optical axis and all refrac- 
tion takes place at this plane. This is the thin lens approximation which neglects the thick- 
ness of the lens. 

2. The rays passing through the point o (intersection of the optical axis and the lens plane L) 
are called principal rays and will not be deviated. 

3. All incident rays parallel to a principal ray will be focussed to the point at which the prin- 
cipal ray intersects the back focal plane F' . That is, the light reaching a point in the 
back focal plane F' at a distance p = f tan0 (f is the focal length- distance between the 
planes L and F ) from the optical axis is contributed by a principal ray making an angle 
0 with the optical axis plus all rays parallel to this principal ray. 

4. If we construct a plane P perpendicular to a bundle of parallel incident rays, the optical 
path length will be the same along any of the parallel ray paths from p to the common 

point of focus in F ' . * 

For any set of parallel rays, Figure 1, represents the projection of the parallel rays onto 
the plane through the optical axis and the principal ray as shown in Figure 2. In Figure 1 the 
distance p and p, are measured from the optical axis in the plane of the figure. As shown in 
Figure 2, these distances p and p 1 represent different quantities in the planes F and F' 
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respectively. The distance p in the back focal plane F ' is the distance from the optical axis 
to a point (x, y). In the plane F, p, is not the distance from the optical axis to a point (x, , y, ) ; 
p, is the projection of this distance onto the axis defined by the intersection of the plane F with 
the plane through the optical axis and a point (x, y) in the plane F' . Since the orientation of the 
p, axis will depend on the point (x, y) being considered in F',p, will be a function of x, y, x, , 
and y, whereas p depends only on x and y . The difference in meaning of p and p t also appears 
when the algebraic sign is considered. In Figures 1 and 2, p is the distance from the optical axis 
to the point of focus and is a positive quantity regardless of where the point is located. On the other 
hand p, is a coordinate of the intersection of a ray with the F plane and we will use the sign con- 
vention that p, is positive above the axis and negative below the axis when drawn as in Figure 1 
(rays sloping down to the right). In reference to a point (x, y) in F' the positive p { axis will lie 
in the quadrant opposite to that of the point (x, y) . 




Figure 3— Geometry forp ond p,. 

Figure 3 shows the geometry of the various lines in planes F and F ' and demonstrates the sign 
convention for p y . From the geometry of Figure 3 A we find the relations: 
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p - (x 2 + y 2 ) 1//2 
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From the geometry of Figure 3B we can determine p, lor any point (x, , y,) as follows: 
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f 




t , * r cos .* - r cos (,' - ») - r {cos / cos ' 4 sin / sin »} 

or f> x “ x, cos / - y, sin / 

Substituting the relations for cos 7 and sin 7 obtained from 3A we can rewrite the expression for 
p, as 



t 

| 


“XXj-yy, -xx,-yy, 



The dependence of p on x and y and of p, on x, y. x, , and y, is explicit in equations (1) and (2) respec- 
tively. These results agree with our discussion in the last paragraph. 



I We will now use Figure 4 to determine an expression for the optical path length from a point 

| ( 5 p y,) in F to a point (x, y) in F . Figure 4 shows a principal ray AA and a representative 

I parallel ray (dotted). The plane P is perpendicular to the incident rays. Since we are neglecting 




the thickness of the lens in our discussion, the optical path length r for the principal ray AA will 
be given by the geometrical length: 
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i * i * 
l 2 3 


( 3 ) 


From the geometry of Figure 4, l is given by the expression 


(f 2 2 )‘ 


( 4 ) 


By similar triangles, we find that l . is given by 
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( 5 ) 


For the principal ray AA the optical path length from the plane P to l*&e point A in the back focal 
plane F is L x + l 2 . Using equations(4) and (5) this length is given ae 


t, + 1 2 


df 

+ j~ 
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if + df 


f2 +^2 + df 


'I ' (f 2 + / 2 )« 2 


By the fourth of our assumptions for an ideal lens the optical path length from the plane P to 
the focus point A is the same for all the rays parallel to the principal ray A A (note that£, +l 2 
does not depend on p, ). Therefore the expression found for £, +l 2 holds for every parallel ray 
and the expression for the optical path length r can be written as 


*• * + * a + * 3 


f 2 +p 2 + df 


(f 2 + p 2 )> /2 




( 6 ) 


The term l 3 remains to be determined and by comparison of the principal ray and the representa- 
tive ray in Figure 4 it should be obvious that this term will not be the same for all parallel rays. 
From the right triangle with£ 3 as a leg,£ 3 can be expressed as 


-e. 


pj sin# 
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and the expression for can be rewritten as 


Ho wo ver, *■/ ^ ■ 

(p+p *) 12 


i (| i! t , 2) l 2 (7) 

Substituting for l 3 in the expression for the optical path length r we obtain 


We have previously derived expressions for p and p i as given by equations (1) and (2), Substitu- 
ting for p and p { in equation (8) we obtain 

f 2 i x 2 4 y 2 + elf ~ xx J - yy J 

r (f 2 H X 2 + y 2)l '2 (9) 


This expression gives the oi tical path length from any point (x , , y , ) in a plane F (a distance cl in 
front of the lens) to a point (x, y) in the back focal plane F . To facilitate further discussion we 
will write this expression as 


r = R(x. y) - ax ( - By , 


( 10 ) 


where: 


R(x, y) 


f 2 + df + x 2 + y 2 
(f 2 + x 2 +y 2 ) 1/2 


(f 2 +V 2 ) ,/ « 


ft 




7 ' ' 

(f 2 +x 2 + y 2 ) 1,/ 2 


To summarize our results at this point, we can consider a point A in the plane F as shown in 
Figure 5. Assuming that light is radiated in all directions from the point A, we will consider the 
portion of light propagated in directions at an angle 0 with respect to the normal n to the $ lane F . 
The light rays representing these directions will form the surface of a cone of half angle 6 as 
shown in Figure 5. For each of these rays a parallel principal ray can be drawn and each principal 
ray will make an angle 0 with the optical axis. Thus for each ray at an angle 0 with respect to the 
normal to F at the point A, we can apply the results derived above. That is, each ray is focussed 
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to a point on the ring of radius p ~ f tanfl where the corresponding principal ray intersects the 
back focal plane F . The optical path length from A to each point on the ring is given by equation 
(10). Since this holds for any point A, we can state the general focal properties of our ideal lens 
as: 

1. The light radiated from all points on F in directions at an angle 0 with respect to the nor- 
mal to F is focussed into a ring of radius p = f tan 0 in the back focal plane P* . 

2. The optical path length r from any point (x, , y, ) on the plane F to any point (x, y) in the 
back focal plane f is given by equation (10). 

In these statements the plane F is a plane perpendicular to the optical axis at a distance d in front 
of the lens. 



Anticipating the derivations in the next section, the relation between p and e specified by the 
first focal property above can also be expressed in terms of cos0 as 


cos 9 


f 

(f2 + p 2)l/2 


f 

(f 2 +X 2 +y2)l/2 
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This expression can be derived from the geometry of the figures in this section or can be derived 
from the equation in terms of tun# by applying trigunometric identities. 



FOCUSSED DIFFRACTION PATTERN 


Since we will consider only light distributions on plane surfaces, we can use the Rayleigh- 
Sommerfeld diffraction formula (see Appendix). In rectangular coordinates this diffraction for- 
mula has the form: 



( 12 ) 


Figure 6— Relation between points (x, y, z) and (x,, y,) in diffraction formula. 

This formula gives the complex light amplitude A(x,y,z) at any point in space (z > 0) due to a 
monochromatic coherent light distribution A (x 15 y x ) given for every point (xj, y t ) in a plane F . 
Referring to Figure 6, the terms in the diffraction formula are defined as follows: 

i 

1. A (Xj , y t ) is the complex amplitude of monochromatic light given for all points (x 1; y,) 

in a plane F located at ■ = 0 „ 

2. A(x, y, z) is the complex amplitude of light produced by A (x 1# yj ) at a point (x, y, z) in 
space ( z > 0) . 
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3. r is the distance from a point (x, , y, ) in plane F to the point (x, y, z) 

4. o is the angle between r and n where r is directed from (x. , y, ) to (x, y, z) and n is the 

normal to the plane F at (x, , y, ) in the direction of the positive z axis. The term cos 0 

is usually referred to as the obliquity factor. 

5. k = where X is the wavelength of the monochromatic light. 


In general each of the three vector components of the electromagnetic field representing the light 
distribution must be determined by the diffraction formula. For our discussion we will consider 
the light amplitude distribution as a scalar which requires only one equation^, in practice this is 
permissible if polarization affects can be neglected. Thus we will define the light amplitude such 
that the square of its absolute magnitude gives the intensity which is a measurable quantity. 

We can simplify the diffraction formula immediately by considering the relative magnitudes of 
the terms inside the brackets: 



For wavelengths X as long as 100 microns (far infrared) the first term is relatively large (600) 
while for r larger than 1cm, the second term is less than one. For visible light X is much less 
than 100 microns (0.4 to 0.7 microns) and k is of the order of 10 5 . Since our discussion (as in 
most cases in optics) will deal only with r greater than one centimeter, the -■ term is negligible 
and can be dropped without any appreciable effect on accuracy. The diffraction formula equation 
(12), can therefore be written as; 


A(x, y, z) 


4 


, ik r 


a' ( x i* y i) ~~r cosd dx i d Vi 


(13) 


where the constant factor ik has been taken outside the integral. 

The obliquity factor cos 0 is a weighting factor which accounts for the difference in the amount 
of light radiated in different directions. Since cos 6 has a maximum value of one at e equal to 
zero, this factor has a maximum value of one for light contributions propagated normal to the signal 
plane and drops off as the angle with respect to the surface normal increases. Referring to Figure 
6, if we assume the light from a point (Xj, y t ) contributing to the light at the point (x, y,z) travels 
the straight line r f this line is a light ray at an angle d to the normal n . In the previous section, 
we showed that through the focal property of an ideal lens this angle is a constant for all light con- 
tributing to a point (x,y) in the back focal plane and that x, y and 6 are related by the expression 

f 

(f 2 + x 2 + y 2 ) 1/2 


cos & 


(ID 


In other words, an ideal lens focusses light of constant obliquity factor into a ring of radius 
(x 2 + y 2 ) 1 2 specified for a given t> by the above expression. 

F PLANE F PLANE 
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| 7 B. Focussed diffraction | | 

Figure 7— Comparison of diffraction wbh and without focussing. 

The significance of this focal effect can be seen by comparing the two diagrams in Figure 7. 
In 7A, the points A and B are sample points in the F plane and the point c and D are sample 
points in a parallel plane at z = d + f . If we consider the point c , we note that the paths AC and 
bc have obliquity factors of cos e and cos 9 2 respectively. From this example it is obvious that 
for a point such as c (or d) the obliquity factor will depend on the location of the contributing 
point (x,, y,) in F. Likewise if we consider the point A in f, we can note that the paths AC and 
AD have obliquity factors of cos 9. and 8 respectively. This indicates that the obliquity factor 
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also depends on the location of the point (x, y, z) . Since determining the obliquity factor is in- 
cluded in the derivation of the diffraction formula, we will give it here without proof for z d * f as 
shown in 7 A: 


d + f 

cos t:> ~ 7 -V. 

[(*“ xj 2 + (y - yj 2 + (d+ f) 2 J (14) 

This expression includes the coordinates of both the point (x, , y,) in the source plane and the point 
(x, y, z) at which the diffracted light amplitude is to be found. The expression for the general case 
will have a z in place of (d + f) which was used for the special case of Figure 7A. In the diffrac- 
tion formula the obliquity factor appears under the integral sign since x, and y, are the variables 
of integration and these terms appear in the obliquity factor as given by equation (14). 

Let us now consider the case of focussed diffraction as illustrated in 7B. In 7B only the rays 
AC and BD of 7A are considered and as indicated by the angle 6 { , AC and BD are parallel rays. 

The dotted portion of these rays indicates the path of light followed in 7A. Due to refraction by the 
lens these paths are changed to those focussed to the point E . Now when we consider a point such as 
E we find that the obliquity factor cos0, is the same for points A and B and therefore independent 
of the coordinates (x, , yj) of the point in F . If we consider any other point G in F we recall that 
to contribute to a point G a ray must be parallel to the principal ray OG, Rays parallel to OG will 
have an obliquity factor cos0 different from cos0, for the point E . Thus the obliquity factor 
does depend on the location of the point (x, y) in the back focal plane. The obliquity factor for the 

r> 

case of a focussed diffraction pattern is given by equation (11) as cos 0 = — — and does 

not depend on the coordinates x, and y t . ( f2 + x2 + V 2 ) 

Since the obliquity factor for the focussed diffraction pattern is independent of the integration 
variables \ l and y, , this factor can be taken outside the integral and we can write equation (13) 
as 


A(x, y) 



(15) 


where A(x, y) now represents the complex light amplitude at a point (x, y) in the back focal plane 
of a lens. 


To complete our discussion we will now consider the term r which was defined as the distance 
froin the contributing point to the point of interest. In Figure 6 this distance r is measured along 
the straight line from (x 1# y,) to (x, y, z) . In Figure 7B, the light traveling from A to E does not 
follow a straight line due to refraction at the lens plane L . We can assume that the effects due to 
the length of the refracted path are the same as traveling an equivalent distance in a straight line, 
and the r in the diffraction formula can be interpreted as the optical path length which we determined 
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in the previous section. Thus e ikr represents the change in phase over an optical length r , and 
- is an attenuation factor which decreases the amplitude contribution as the optical path length 
increases. 

Substituting the optical path length expression for r , as given by equations (9) and (10) the 
equation (15) becomes 


-i f ff 

A(x ' V) ‘ r(f’+x» + y’)‘--*JJ r f u , 


A' (x,, y , ) c 


ikp(x,y)-M f/’y j] 


x 2 + y 2 -I df - XX , - yy j" 
( f 2 + x 2 •! y 2 ) 1 2 j 


dx, dy , 


(16) 


J /2 

The terms (f 2 + x 2 + y 2 ) cancel and e ikR (x ‘ y) can be taken outside the integral to give 


-if I I A' (x , y ) c 

A(x, y) = — 11 ' 


II < 


->2W [a x iA+/3y, A] 


2 +x 2 +y 2 +df-xx, - yy, 


dx, dy , 


(17) 


We can now introduce the new variables p and q defined as 


a 

P = K 


\(f 2 +X 2 + y 2 ) 1/2 


(18a) 


.. P 
9 " X 


\ ( f 2 + X 2 + y 2 ) l/ " 2 


(18b) 


and factor f 2 + df from the denominator of the integral of equation (17) to obtain: 


A(*. y) " \(f + d) e‘ kR < x ’ y) JJ 


x (x- x,) + y(y-y t ) 
1 + f(f + d) 


dx, dy, 


(19) 


FOURIER TRANSFORM APPROXIMATION 
We can now use the algebraic identity: 


l + 


M 

N 


1 + 


N 

M 


13 


l 





to obtain: 


x(x- x t ) + y(y _ y , ) 
f(f + d) 


f(f +d) 

x(x-x,) + y(y-y,) 


Introducing this identity we can write equation (19) as 


A ( x - v) ” \(f + d) 


e ikR(x,y) 


Ilk 


: (f + d) 


x(x- Xj) + y(y~ y,) 


A' ( Xj , y j ) e 12 # 5<1 l,y ^ dxj dy 1 


We can rewrite equation (20) with an integral for each term in the bracket which gives 

A(x, y) = \(7Td) e ikR C x -y) JJa' (x,, y,) e i27r ( px i +qy 0 d Xj dy, 


TTd) eikR(X,y) Jj ~ f(f Td)- dx t dy 1 

1 x ( x-x i) + y(y-y l ) 


By restricting the maximum values (aperture limits) of x, y, x 1 , y, , the second integral of equation 
(21) can be made negligible compared to the first since the denominator of the integrand can be made 
large. This approximation will be discussed in more detail later; here we will simply assume that 
it is possible to neglect the second integral. 

The diffraction formula can then be written approximately as: 

_ j e ikR(X,y) 

A ( x - y) = \(f + d> f <p- £ >) (22) 

where F(p,q) is the two-dimensional Fourier transform of A'fXj.yj) • 


>. <0 = JJ A ' (Xj, y,) e ,277 ( px . +qy O dXj dy t 


t 


If we were to measure this light distribution, we would measure the intensity which is the square 
of the magnitude of the complex amplitude A(x,y) . This intensity is given as: 


i(x, y) 


A(x, y) A* (x, y) 


[\(f + d)]2 


F(p, q) F* (p, q) 


or I(x, y) 


I F(p, q ) 1 2 

[Hf + d )] 2 


(23) 


Thus the intensity in the back focal plane of a lens (within the limits to be determined for the ap- 
proximation made) is given by the square of the magnitude of the Fourier transform of the light ampli- 
tude in the plane F . 


If our aperture restrictions in the back focal plane limit the maximum values of x and y so that 
the phase variations due to the exponential term in front of the integral in equation (22) can be con- 
sidered constant, we can write 


A(x, y) ~ KF(p, q) (24) 

where K is a complex constant given by 

-i e ikR(x,y) 

K r T(f + d ) 

Thus within the range of x and y for which e ikR(x,y) can be assumed constant (i.e. negligible 
phase variation) the amplitude distribution in the back focal plane is proportional to the Fourier 
transform of the light amplitude distribution in the plane F . This relationship requires tighter re- 
strictions on x and y than our previous approximation. In terms of spectrum analysis in the back 
focal plane, this relation is not important since only intensity can be measured. However, in cas- 
caded lens systems, the Fourier transform relation between amplitudes allows each pair of lenses 
to be accounted for by a double Fourier transform operation. The advantages of such a relation will 
be demonstrated in a later section. 

FOURIER COMPONENTS ( P , q) AND FOCAL PLANE COORDINATES (x, y) 

In our previous discussions we have expressed the amplitude distribution A (x,y) in terms of 
the Fourier transform of A' (x,, yj) (refer to equation (22)). However, the Fourier transform co- 
ordinates are p and q which we defined by equations (18) as 


p - 


\(f 2 + x 2 + y 2 ) 1/2 


and q 


\(f 2 +x 2 + y 2 ) 1/2 


(18) 


Substituting for p and q in equation (22) we can obtain an expression for a (x,y) in terms of x 
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and y ; however, this result is somewhat complicated by the fact that p and q are each dependent 
on both x and y . When the above expressions are substituted for p and q , the optical Fourier 
transform is given by 


F(p ‘ q) F (a ( f 2 -l x 2 + y ' 2 ) 1 2 ’ K (f 2 + x 2 + y 2 ) 1 2 ) 

.. ff., / v -i2«[xx A/f 2 + x 2 +y 2 )' 2 +yy /A (f 2 + x 2 + y 2 ) 1 2 ] 

- JjA'(x liyi )e ( ‘V&i 


(25) 


I 


A more desirable relationship would exist if p were directly proportional to x and if q were directly 
proportional to y . Then the light amplitude at a particular value of x would be related to a particular 
value of p and a similar relation would exist between q and y . As given by equation (18) p and x 
(also q and y ) are not so simply related since P also depends on y ( q also depends on x ). In a 
later discussion the importance of a linear relation between the transform coordinates p,q and the 
spatial coordinates x, y will be shown. 

To obtain a linear relation between p and x let us consider the series expansion of equation (18a). 


x 2 + y 2 3 /x 2 + y 2 \ 2 
~2f ~ + 8 \ f 2 ) 


If we restrict our analysis to an area of the focal plane such that 



x 2 + y 2 

2f 2 


« 


1 


we can neglect all but the first term in the brackets to obtain the approximation 


X 

p " \f 


(26) 


Similarly, we can obtain an approximation of q given by 


9 c XT 


(27) 
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Thus within a restricted area of the back focal plane the Fourier transform expression can be 
written as 


f(p. q) ' F (rr ivr) JJ A ' ( x i * y i) 


-I2w(> 


A 


f+yy ,Af 


dx, dy, 


(28) 


The coordinates x and y in the back focal plane are then scaled representations of the frequencies 
p and q respectively. That is, light contributions corresponding to a spatial frequency p in the 
*i direction appear at the coordinate x = Xfp in the back focal plane (similarly, contributions of 
spatial frequency q in y f direction appear at y - Xfq ). 


The actual restriction to be imposed on x and y for the above approximation will depend on 
how accurate a Fourier frequency value is required in a particular application. The error in the 
approximate frequency of equation (26) and (27) as a fraction of the exact value given by equations 
(18) is 


E 


f 



(29) 


We can let r 2 - x 2 + y 2 (r is the radius of a circle in the x, y plane) and express r in terms of mul- 
tiples of the focal length f as given by 


r ~ af 


(30) 


Substituting r 2 = x 2 + y 2 = a 2 f 2 in equation (29) we obtain 


E f ~ (l + a 2 ) 1/2 - 1 (31) 

The curve of Figure 8 gives the percent error (100 E f ) of the linear approximation of frequency 
as a function of a . For a less than 0.14 the error will be less than 1%. Thus the linear 
approximations 


X 

(26) 

JL 

(27) 


are accurate within 1% for values of x and y satisfying the restriction 

(x 2 +y 2 ) 1/2 ~ r $ 0. 14f 
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For accuracies better than 1% smaller values of n must be imposed as given by equation (31) and 
Figure 8. 



Figure 8— Percent error in linear frequency approximation. 

Since our approximation requires that we limit our consideration to the area within a circle of 
radius r max =0,l4f (for accuracy within 1%), the maximum value of x 2 + y 2 is specified by: 


( 


x 1 ty = ) 


max 


- . 02f 2 


Squaring the approximate expressions for p and q 


[equation (26) and (27)] and adding we obtain 


p 2 4. q 2 


x 2 + y 2 
f 2 \ 2 


(32) 
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Applying the restrictions on x 2 + y 2 to equation (32) we obtain 


The maximum allowed value of p occurs when q - 0 and the maximum q occurs when p = o : 

, 14 

p m «x x - for ' 0 ( Le ' y °) 

, 14 

q milx X - ,or p 0 (i.e. x - 0; 

As an example, consider green light of wavelength X= 5461 X I0' 8 cm . In this case the simplified 
expressions for p and q are accurate within 1% for frequencies in the range given by: 

(p 2 + q 2 ) 12 “ - 7 — " — pgi — « 2560 cycles/cm. 

On the x axis (y = 0, q = 0) the maximum frequency will be 


Pinnx = 2560 cycles/cm. 

On the y axis (x = 0, p = 0) the maximum frequency is likewise 


% ax ~ 2560 c y c l e / c m. 


In practice the limitation of available techniques for controlling the input light distribution 
A'fxj, y,) restrict maximum spatial frequencies to values below the 2560 cycles/ centimeter re- 
striction we have imposed above. Therefore, the linear approximations of the frequency compo- 
nents p and q [equation (26) and (27)] are applicable for practical systems and equation (22) and 
(23) can be expressed as: 


A(x, y) 


i(x, y) 


= K r(xf 

, JL 

jJjL, 


| F Uf 

Kf) 


[\ ( f+d)p 


(32) 


(33) 
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The accuracy of p and q used in equations (32) and (33) a c given by (18) is determined by equa- 
tion (31) as shown in Figure 8 for values of a =j . Thus for a given focal length f , the restriction 
on the maximum value of r determines the accuracy of the linear approximation introduced here. 

Of course, these equations also include the approximation assumed earlier in neglecting terms other 
than the F(p,q) term. In the next section we will consider that approximation and we will determine 
if the restriction x 2 + y 2 ' 0,02f 2 is also a sufficient restriction to assure the validity of neglecting 
terms other than F (p,q) 


APPROXIMATION LIMITS FOR THE FOURIER TRANSFORM REPRESENTATION 


Wo will now return to the focused diffraction formula given by equation (20) as 


A(x, y) 


_ ie ikR(x,y) ff* 

"Mf +d ) JJ 1 


1 

14 sm ) 

x ( x " x i) + y(y-vtl 



2W(px J +qy 1 


dXj dy t 


(20) 


and consider the limitations required to obtain the Fourier transform approximation given by equa- 
tion (22). To obtain the form of a Fourier transform of a' (x, , y, ) , the bracketed term must be 
approximated by a constant. This term can be assumed equal to one if we restrict the range of 
x, y,Xj, and y 5 to satisfy the inequality 


~ terra) - 

x 2 +y 2 - (xxj +yy t ) 


« 1 


Referring back to equation (21) this approximation corresponds to making the second integral neg- 
ligible 'ompared io the first integral which has the form of a Fourier transform The complete 
term inside the brackets of equation (20) is effectively a weighting factor which varies the contri- 
bution from each point) (x, , y^ to the point (x, y) . This factor represents the effect of the 
obliquity factor and path length attenuation. It is usually assumed that these effects are negligible 
and that the inequality condition is satisfied. In the following analysis we will attempt to present a 
more detailed quantitative discussion of this approximation. 


Neglecting the variable term when it satisfies the inequality condition given above is an approxi- 
mation of the light amplitude contribution from each point (x, , y, ) to a point (x, y) . That is, the 
contribution dA(x, y) at a point (>;, y) from an infinitesimal region dx t dy r about the point (x , y ) 
is given exactly by 


dA(x, y) = Kj 


1 - 


1 + 


f ( f + d) A ' ( x i> yj e ,277 ( px i + qy i) dx t dy t 


x ( x ~ x i) + y(yyiU 
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and applying the approximation of neglecting the variable term inside the bracket we obtain 


dA(x, y) ■* KA' (x r y,) e ,2 ”( px i +l,y i) dx, d yj ( 34 ) 

The total light amplitude A(x, y) at a point (x, y) is obtained by integrating over the range of x, 
and y, . The integration of equation (34) will yield an approximation for the total light amplitude 
A(x, y) at least as accurate as the worse case of equation (34). That is, the greatest possible 
error would be given by the maximum value of the neglected term. 

Thus to determine the limitations to be imposed, we will consider the maximum ♦ rror intro- 
duced by neglecting the variable term to obtain equation (34). Denoting the error by the fraction 
E a given by the ratio of the neglected term to the exact factor within the brackets of equation (20) 
we obtain: 


x 2 +y 2 - (xxj + yy j ) 

e a = f(f + _ a> (35) 

To simplify our discussion we can express x, y, x x and y, in terms of polar coordinates 
r, <f>, r, , and <p i . The relations between these coordinates are given by the equations 


r 2 - x 2 +y 2 r , 2 = x j 2 + yj 2 


x - r cos 0 Xj - r cos^ 


y - r sin<£ y, ~ fj sinc/^ 


Substituting in equation (35) we obtain 


E«, 


r 2 - r r ( cos 4> cos 4> i + sin 0 sin 4> 1 ) 
f(f +d) 


(36) 


cos (0 - = cot<p cos <t> l + sin <t> sin^j equation (36) can be rewritten as 


r 2 - rr cos (<£ - 4> l ) 
f(f + d) 


(37) 
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Using the identity 


The cosine term can take values between minus one and plus one. Since we are interested in 
the maximum error, w.e will consider the case for coi(<£ -</>, ) = -I . Equation (37) can be rewritten 
for cosine equal to minus one as 


r 2 + r r j 

e a r 


(38) 



Figure 9 — Diagram of conditions for maximum E A . 

Figure 9 shows the relative positions of points (x^y,) and (x, y) for the case cos(0 - 4 > x ) = —1. 

As shown by the figure, the maximum error defined by equation (38) applies to the light contribu- 
tions from points (x 1? yj) located on the line of intersection 0 L T between the planes F and Q . The 
plane Q is a plane containing the optical axis and the point (x, y) in the back focal plane F ' . The 
points (x 1( yj) are further restricted to the portion of the line of intersection of F and Q which is 
on the side of the optical axis opposite from the point (x, y) . From the geometry c f the figure it 
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is clear that the angle <f> is equal to n + <£, . Thus <(> — <f> t is equal to n and cosine ( <p — 0^ = 

cosff - — I as required for the maximum F, A given by equation (38). For any point in the F plane 
which does not fall on the line Q,T the cosine term will be greater than -1 and the value of K A will 
be less than that given by equation (38). 

Examination of equation (38) shows that the error E A increases as r and r, increase. There- 
fore, to determine the maximum value of E A as a function of r and r, we need only to specify the 
maximum values of r and r, . Conversely, if we are interested in restricting the value of E A to 
be less than or equal to a specified value the maximum values of r and r, must satisfy equation (38) 
for that particular value of e a . 

In order to analyze the relation between maximum E A , r, and r, we must consider the inter- 
dependence of the maximum values of r and r, due to the limitations of a finite lens aperture. In 
our discussion, we will assume that diffraction effects at the end of the lens aperture are negligible. 







Figure 10 shows the extreme rays which can pass through a lens aperture of radius R L to reach the 
points at the distance r max from the optical axis. It should be apparent that any ray parallel to, 
but above the upper extreme ray; or parallel to, but below the lower extreme ray will be outside 
the lens aperture and will not pass through the lens. Thus any signal point outside the ray defined 
by r, max in Figure 10 cannot contribute to both of the points +r max and ~r . For example, if 
the signal area extended upward beyond the r, tnax limit, the additional signal interval cannot con- 
tribute to the spectral point at + r max since the necessary light path will fall outside of the lens aper- 
ture. Under these circumstances the amplitude at the spectral point at +r nwx will not correspond 
to the entire signal but only to the interval below the +r lmax limit. From this example it is ap- 
parent that the r Imax limit given by Figure 10 defines the maximum signal interval over which every 
point contributes to the spectral points at ± r . 

The dashed lines in Figure 10 represent the extreme rays to a spectral point at a distance r 
which is less that r max . It is seen that the extreme rays for such a case define a maximum signal 
interval longer than that obtained for r max . This means that the signal interval defined by r lmnx 
increases as the spectral range of interest defined by r max decreases. Thus we see that for a 
given maximum frequency (i.e. r max ) the maximum value of the signal interval r lmax is limited 
by the lens aperture. 



Figure 11 —Geometry for relation between r mox and r, moK 


24 


f 


To derive an expression defining the relation between r max and r Jmttx we will use the geometry 
of Figure 11. This figure represents the upper extreme ray and the principal ray contributing to 
the spectral point at + r max . Since the extreme ray must be parallel to the principal ray, the angles 
0 are equal and we can apply the principles of similar triangles to obtain 


r r 4 

max l max 

_ - — — 



a + d 


From these relations we obtain two equations for a : 



Since the right hand sides of these equations must be equal, we obtain the result 


__ . max 

r ...« “ - d T" 


(39) 


A lens is usually specified by its F stop which is defined as 


2 R, 


(40) 


Dividing both sides of equation (39) by f we obtain 


1 max 


f 


d r max 

T T~ 


(41) 


From equation (40) we find that = L_ and substituting into equation (41) we can write 

f 2F 


lmax 

___ 


d r „ 


2F " 7 f 


(42) 


It is obvious from equation ( 42 ) as well as Figure 10 that i-j max cannot be greater than the lens 
aperture radius r . Equation (42) defines the maximum allowed signal aperture radius r 


1 max 


due 


to the limitations of the lens aperture. In practice the size of the signal aperture is specified by 
physical consideration or a desired size format. We can rearrange terms in equation ( 42 ) to define 
the maximum spectral term r as 




I r imnx 



( 43 ) 


Equation (43) defines the maximum allowable r for a given r, max as determined by the restriction 
of a lens aperture. By rearranging terms in equation (38) we can obtain a second expression speci- 
fying the limitations on r max required for an allowed error E A . 


r 

max 

nr 



(44) 


t t i 

To demonstrate the application of equation (43) and (44) we will consider the case for — =-r 

P J, ^ 

(e.g. for f = 100m m, r. = 20mm ). In Figure 12, we have plotted — as a function of for the 

f J 1 I 

specified input aperture, - * ™ ax ~ j . The curves labeled F = 1.4 and F = 2 correspond to equa- 
tion (43) for the specified values of F . The curves labeled E A = 0.02, E A = 0.01 , and e a = 0.005 
correspond to equation (44) for the specified values of E A . The F curves specify the upper limit 
on j due to the lens aperture and the E A curves specify the upper limit for a given accuracy of the 
approximation. For a chosen value of ^ , the value of must be below both the F and E A curve 
which apply to the particular system being considered. 


For example, consider the case in which a lens with F = 2 is to be used and the maximum error 

to be allowed is E A = 0.02 . The greatest value allowed for ^ corresponds to the point a which is 

the intersection of the F = 2 curve and the E A = 0,02 curve. The value of 1 = 0.5 would be selected 

to obtain the value — 1 225. = 0>1 . For any other value of j- the limit on I™* would be less than the 

f d f r 

maximum at point A . For r less than 0. 5 the E. = 0.02 curve specifies a tighter limit on -223* 

while for j greater than 0, 5 the F = 2 curve limits . Of course any combination of and j~ 

corresponding to a point below the curves is allowed; the curves only define the upper limit on 5]p 

for a given value of 7 . 

The specification of a desired value of the maximum error E A is not readily determined in prac- 
tice. Since the error in the contribution from (x J( y x ) varies from point to point, the total effect 
of the error can not be determined unless the integration of the exact expression given by equation 
(20) can be evaluated. To circumvent this difficulty we will consider a more or less logical selec- 
tion of parameters and determine the maximum E A specified by these parameters. The value of 
E a determined will then specify the maximum error in the approximation we are considering. 

In Figure 12, the dashed line corresponding to = 0.14 represents the limit determined in 
the last sectio ■ for an accuracy of better than 1% in the linear relation between spectral frequency 
and back focal plane coordinates. From our previous discussion of equation (38) we note that the 
error e a decreases as j- increases. Therefore, if we do not wish to lower the previous limit of 
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r (I 

=0.14 , we can improve (decrease) the error B A by using the largest possible value of T • 

I cl 

Referring to Figure 12, we can note that for a lens with F = 2 , the maximum value of r is 0.3 

r 

which is given by the intersection of the dashed line y 0, 14 and the F = 2 curve. As noted on 
Figure 12, the value of E. at this point is 0.037 (from equation (38)). If we considered a lens with 
F = 1.4 ‘ , Figure 12 shows that we can increase to a value of 1. 1 and reduce the error to F A _ 
0.023. Thus we obtain the usual result that the lens of lower F provides the better characteristics 
(lower F implies larger lens aperture for given focal length). In addition to having the larger error 
E a the F = 2 lens restriction of y=0,3 presents practical problems— the lens mount and input 
aperture mount must be designed to allow for a small spacing (d = 3 cm for f = 10 cm) . 



f 


Figure 12— Limitation on maximum spectral term r mox /f. 
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In practice lower values of and f may be satisfactory. In such cases the error E A 

would be less than the e a 0.023 determined here and higher F lenses (smaller lenses) may be 
used. Here we have considered an extreme case and determined what amounts to an extreme error 
e a 0.023 (or 2.3%). This extreme value of the error introduced by the Fourier transform ap- 
proximation is quite reasonable and should be sufficient justification for using the transform ap- 
proximation in most applications. 


In the next sections we will consider the phase of the light distribution in the back focal plane 


F' . It will be shown that selecting t « l has advantages in reducing the phase factors not as- 
sociated with the Fourier transform. In Figure 12 the point corresponding to j 1 and 
0. 14 is shown to have an error value E A 0,024 . Thus it is seen that for the extreme case con- 
sidered in the discussion above reducing the value of 7 by one - tenth increases the error by 0 . 001 . 
Such a slight increase in the error E A is quite reasonable in terms of the advantage gained in tile 
phase approximation treated in the next section. In addition, since the location of a plane at a 

value of t can never be completely accurate, the displacement from the F 1,4 curve allows a 
' cl 

safety margin of + 10% allowable error in the location specified by 7 ~ 1 without exceeding the 

limitations imposed by the lens aperture. 


Thus we have shown how equations (38), (43), and (44) can be used to determine and/or specify 
the parameter limits and the accuracy of the Fourier transform representation: 


A(x, y) 


_j e ikR(x,y) 

T(f + d) 


y,)e ■W'O dx-j dy t 


(45) 


In particular we have shown that for the maximum values 


' mfi x 


0 . 14 and 


I m a x 

~F~ 


l 

5 


and the de- 


j - U( 1<t f £ J 

sirable choice of 7 = l , the worse possible error in the amplitude values given by this equation is 
2.4%. Since the term neglected in equation (20) is negative, the approximate amplitude given by 
equation (45) will be higher than the exact values by no more than 2. 4%. 


FOURIER TRANSFORM REPRESENTATION OF OPTICAL IMAGING 
By limiting the area of consideration in the input plane 

/ r 1 max l\ 

(fT- < 5 ) 

and in the back focal plane 

/ r max \ 

vr- < •“) * 
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we have shown that the light amplitude distribution A(x, y) in the back local plane of a lens is given 
with reasonable accuracy by 


where 


A( x, 

y) 

O' 1 cl) 

F(p. 

Cl) 

(46) 

P 

X 

TF 

q 

Jj 


(47) 

y) 

f 2 > (If f 

X 2 C y2 

1 

'2 t (|f t r 2 



’ y 2 ) 1 2 

i 


(f 2 * r 2 ) 1 2 

(48) 

C|) 

j> (*, 

' y>)° 

i2«(px | 

l+<)y ^ dx, dy, 

(49) 


As given by equation (49), F(p, q) is the two - dimensional Fourier transform of the light amplitude 
distribution A' (x,, y t ) in a plane perpendicular to the optical axis and at a distance d in front of 
the lens. 

As pointed out in the discussion of equations (22), (23) and (24), the phase term e ikR <*' y > 
is of concern only when a second lens is introduced to produce an image as shown in Figure 13.* 



Figure 13— Two fens optical imaging system. 


♦We are considering only conventional optical systems here. If holographic techniques are considered, the phase factor in equation (46) 
would determine the form of the interference pattern produced by A(x, y) and a reference signal. 


29 


i 


To simplify our notation we introduce K defined as 


-i £ ,ikfU;x,y ) 

~T : d')‘ 


and write equation ( 46 ) as 


A(x, y) KF(p, q) 

In Figure 13, the light amplitude distribution A' (x,, y,) in the input plane F is given as sex 
Using equation (49), (50), and (51), the light amplitude distribution A(x, y) in the plane F 
focal plane of lens L, ) is given as 


A (x, y) 


k, y.) o dx , dy , 


where 


x y 

p " kf , * - 


K 




ikR . (x,y ) 

M f , td .) 


f J 2 + d l + r 2 


R i (*’ y ) 


Similarly, A(x, y) is the input signal to the lens L 2 , and the light distribution 0(x 2 , y 2 ) i 
output plane F" (back focal plane L 2 ) can be written as 


°(x 2 , y 2 ) = K 2 |Ja(x, y)e-^<P^^y) dxdy 


where 


(58) 


K 


2 


„ u , ikR 2(*rA) 

(V “0 


R 2 (x 2 , y 2 ) 


(<7 * r 2 2 )' 2 


(59) 


Substituting (52) into (56) we obtain an expression tor the output image 0(x 2 , y 2 ) in terms of 
the input image S('x,, y,) 


°(x 2 . y 2 ) *• K 2 X+!,y ) dxd v| K , JJ s(x,, y,) < f , 2 w ( px i +w i) dx ( dy ( 


(60) 


We will assume that the function S(x t , yj) allows the order o;i integration to be reversed and 
rewrite equation (60) as 


f 



i 




y,) dx, d y 



^-ia-TT^px j + qy j+p'x+q'y ^ 



(61) 


The integral within the brackets is complicated by the presence of the factor K t which contains an 
exponential dependent upon x and y . If we limited the values of x and y (i.e. r/f ) so that the 
phase variation in K y can be considered negligible, the K, factor can be taken outside the integrals 
giving 


°( x 2- y 2 ) 



y,) dXj dy 



(62) 


We now consider the integral within the brackets and substitute for p , q, p'and q' from equations 
(53) and ( 57 ) 


jJ e “ i2Tr ( pXl+qy ' +P ' x+q ' y ) dx dy = J* e " i2OT ( x 


,Af i +x,Af . 


)’ 


dx^e 


2^(y ,Af ,+y jAf 2 y 


dx 


(63) 


Up to this point we have not mentioned the limits of integration. Due to the presence of aperture in 
an optical system the signals exist only over a finite range of the aperture coordinates. However, 
since the signals are zero outside this range (e.g. S(x,, y t ) = 0 outside the aperture in the f 

plane) the contribution to the integral beyond the aperture limits will also be zero. Thus, we can 
take the limits of integration to be from to +». These limits are in agreement with the Fourier 
transform integrals. 
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The Dirae delta function can be defined by the integral equation: 


r 


S(x ~ a) ~ I d /./ 

“00 


( 64 ) 


Comparing each of the integrals on the right side of (63) with the integral in equation (64) we find 


JJ e -, W( p.,* w , t p.»^ )djidy „ s(^ + ^)s(^ + ^) 


(65) 


- f 


i 5 ( x i + r 2 x 2 ) s (yi + r 2 y 2 ) 


In the last step of equation ( 65 ) we have used the identity 


S ( ax ) = TaT S ( x > 


Equation ( 65 ) is valid only if r and y range from -® to + 03 . In optical systems this is not the case 
Since the range of the coordinates x and y is limited as demonstrated in our previous discussion. 
However, we assume ( 65 ) valid to simplify our discussion. Substituting equation ( 65 ) into equation 
( 62 ) we obtain 


0(x,, y 2 ) = K 2 Kj A. 2 f 


i 2 JJ s ( x i* y ») s ( x i + t ~ 2 x 2 ) s (yi + y 2 ) dx i d yi 


(66) 


Now, we can make use of the sifting property of the Dirac delta function which is defined by 

J*F(x) S(x + a) dx = F(~a) 

Applying this property of the delta function to equation (66) we obtain 


°( x r y*) 




(- 


f i \ 

r 2 x 2 y 2 ) 


(67) 


In deriving equation ( 67 ) we assumed that Kj was approximately constant. The factor K 2 is variable 

only in phase as seen by referring to equation (58). Since only intensity is seen or measured, the 

♦ / . . 

phase variations of K 2 can be ignored and equation ( 67 ) can be interpreted as giving the output image 
0(x 2 , y 2 ) in terms of a proportionality factor (K 2 K t \ 2 f 2 ) multiplying the original input signal S 
expressed in the new coordinates 


(- t; S' - T7 y a ) 


To clarify the significance of these new coordinates, let us consider the relation between 


s(x,, y ,) -• s(- ^ " rj s) 


(68) 


Since the two sides of relation (68) correspond point for point, we find that (x,, y,) and (x 2 , y ) 
coordinates are related by 


t: * 2 




t 2 y 2 


(69) 


Equation (69) represents the fact that a point of the signal which was originally at the coordinates 
Xj and y, will be imaged to the point at 


Xj and y. 


-T7 y , 


The magnification in an optical image is defined as the ratio of the imaged coordinate of a point to 
the original coordinate 


m 


m 


(70) 


Equations (70) were written separately although it is apparent that here the magnification is the 
same in any direction. In some cases it is possible to obtain different magnifications in different 
directions (e.g. cylindrical lens system). Equations (67) and (70) show that the output .image is 
proportional to the input image with a change in scale. The minus sign which appears in equation 
(67) and (70) represents an inversion of the image. 

In many applications, there is no requirement for a magnified image. In such cases, we could 
use lenses of equal focal length f j = f 2 and obtain a magnification m = - l . 

For the case f 1 = f 2 = f , equation (67) becomes 


°(v y 2 ) " K 2 K t f 2 s (-x 2 , -y 2 ) 


(71) 
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Thus, for equal focal length lenses the output image 0(x 2 , y 2 ) is proportional to an inverted re- 
plica of the input signal. 

It is for this case (f, = f 2 = f ) , that the optical imaging process can be described as con- 

secutive Fourier transforms. This can be shown by replacing f 2 by f in equation (57) and sub- 
stituting for p' and <i' in equation ( 56 ) to obtain 


0 ( x 2 * y 




A(x, y) e 


■i27r(K 2 *Af + y 2 yAf) ^ 


by replacing f l by f in equation ( 53 ) we find 


x y 

p - xr ’ * - ki ' 


dx = X.f dp , dy “ \f dq 


Substituting (73) into equation (72) we obtain the result 


°( x 2 ’ y 2 ) ~ K 2 Jv2 f2 )| A ( X ’ y) e l 27 r ( p!, 2 +qy 2 ^ dpdq 


(74) 


Using equations (51), (52), (71) and (74) we can express the two step process of optical imaging as 

A(x, y) = K 1 F(p, q) = K » ]} S ( X l ’ y l) 6 '*** ^ d Xl dy, (75) 

°( x 2 . y 2 ) = K 2 K, \ 2 f 2 S(-x 2 , -y 2 ) = f 2 F(p, q)e' l2Tr(PX2 + qy2) dpdq (76) 

where 


p " Xf ’ 


q - XT 


. ikR « ( x , y ) 
-ie 1 

M f + d >) 


ikR 2 (x 2 ,y 2 ) 


f 2 + fdj + r 2 


f 2 + fd 2 + r 2 2 


R ! ( x » y) (f 2 +r 2 )l /2 ^2 ( x 2 ’ y 2 ) [f 2 + r 2 ^ 


-2 - x 2 + V 2 


r 2 - x 2 + v 2 
2 2 y 2 


The last expression in equation (76) assumes that the factor x x can be considered constant in 
phase over the range of the values p and q (i.e. x and y ). This approximation is the subject 




we are about to consider and here v/e arc showing the advantages of the resulting expression. To 

appreciate the significance of equations (75) and (76) let us consider the standard Fourier transform 

♦ 

equations using our notation 


F(p, q) J 

[s( V y, da.dy, 

(77) 

s (x,,y,) * 

j*J" Ffp, q) e i +<ly i) dpdq 

(78) 

s (" x i* _y i) 

*= j” f Ff p , q) e’ iJ ” l '”' |, '' y, Olp<lq 

(79) 


Equation (77) is usually referred to as the Fourier transform while equation (78) is the inverse 
Fourier transform. Note that the exponent of equation (78) is positive and that of equation (79) is 
negative. Since the optical transform produced by a lens has a negative exponent, the inverse trans- 
form defined by equation (78) never appears in optical systems. The second lens in an optical system 
such as that of Figure 13 produces a Fourier transform of a Fourier transform as represented by 
equation (79). Note that the inversion (or change of sign of the coordinate) is introduced by the 
second Fourier transform whereas an inverse transform would not invert the signal. Thus, com- 
paring equations (75) and (76) with equation (77) and (79), we note that the optical imaging process 
of two lenses is described by two successive Fourier transforms relations. Except for determining 
the absolute amplitudes involved, the constants in front of the integrals of equations (75) and (76) 
do not affect the form of the variations. In most cases only the relative amplitudes are of interest 
and the constants are dropped. 

The advantage of the Fourier transform representation described by equation (75) and (76) can 
be shown by considering the introduction of a filter in the F' -lane. If we know the transmission 
characteristics of the filter, we can determine a function M(p,q) which represents the fraction of 
incident light amplitude passed at each coordinate corresponding to the values of p and q. The 
filtered output o f (x 2 ,y 2 ) is then given by equation (76) if we replace F(p,q) by M(p,q) F(p,q) 

°f ( x 2' y 2 ) = K 2 K 1^ 2 r ' jj M (P’ q) F (P- O' 2 ,27T ( pX 2 + qy 2) dp dq 

Thus, the specification of a filter for a particular application can be determined uniquely when the 
Fourier transform representation is used. 

We might note at this point that the Fourier transform representation of equations (75) and 
(76) require the use of lenses of equal focal length. The specification of a filter for the case of 
unequal focal lengths is exactly the same; however, the Fourier transform relation of equation (76) 
is modified by introducing a factor o. f ,/ f 2 i" the exponent to account for the magnification. 


This additional f } /f 2 results in a magnified filtered image which contains the same information 
as the filtered image o f (x 2 ,y 2 ) given by equation (80). The only difference (neglecting the phase 
of K 2 ) is in the scale. Throughout the remaining part of this report we will consider the special 
case of equal focal length lenses to simplify our analysis. 


ELIMINATION OF UNDESIRABLE PHASE VARIATIONS 

Now that we have seen the significance of the Fourier transform representation of optical 
imaging, we will consider the approximation involved in the derivation of equation (76). The actual 
relation corresponding to equation (76) can be written as 


°( x 2 , y 2 ) = K 2 \ 2 f2 JJ K, F(P. q) e' i2 ” (pX2 + qy2) dpdq 


(81) 


The term K, appearing in the integral was defined as 


. ikR j(x,y) 

M f + d i) 


(82) 


where 


f 2 + fdj + r 2 


R 1 ( x > y) + r 2j 1/2 


(r 2 = x 2 + y 2 ) 


(83) 


Comparing equations (81) and (76) it is apparent that equation (76) is valid only if the phase vari- 
ation of Kj can be neglected over the range of the integration variables p and q (or x and y ). 

We will now proceed to analyze this requirement. 

For reasons that will become evident, we would like to express R, (x, y) in the form of: 


R, (*. y) s (f +d t ) + P(f-dj) + Qf 


(84) 


f(l +P+Q) + dj (1-P) 


Equation ( 83 ) can be rewritten as 


R ! ( x > y) 


f 2 + r 2 d i f 

+ ttt rrrr: - f 


( f 2 +r 2jt/2 (f 2 + r 2)l/2 


(» + 73) + d . (* + 73) 


•1/2 


(85) 
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Equating the coefficients of f and d in the final forms of equations (84) and (85) we obtain 


l - P 


l + P + Q 



Solving for P and Q we obtain 


P ~ 



Q :: 




2 = 



Substituting for P and Q, equation (84) can be written 


R 1 ( x - y) 


( f+d .) * ( f - d i) 




l 


The K term given by equation (82) can be rewritten using equation (86) 


( 86 ) 


llcf f-d.)[l-( l + r 2 /f 2 )' 1/2 ] i2kf [( l+r2/2f 2)( J+ r 2 / f 2 )“ l/2 ~ l] 

e e 

(87) 

The terms grouped within the first brackets of equation (87) are constant and therefore can be 
taken out from under the integral sign in equation (81). The remaining exponentials in equation 
(87) are phase factors which depend on the variables of integration. The exponential of the re- 
maining terms must be limited so that the phase variations can be considered negligible. 

r 

Since we are going to consider restrictions on the value of j so that the phase terms can be 
considered negligible, we can simplify the exponentials of equation ( 87 ) by expanding in power 
series of and drop all but the first terms of the expansions. Expanding the exponent of the 
first exponential we obtain 

e ik(f-d 1 )[l-(l + r 2/f2)‘ 1 / 2 ] _ ^ik(f-d ] )[l~{l-(r2/2f2)+3/8( r 2/f2) 2, • *}] 

ik(f— d.)(r/f ) 2 /2 

- e 
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K ! = 


• ik ( f+cl i) 

•ie 1 


i~H f d i) J 


t 


Similarly, we expand the second exponential and obtain 


i 2kf[< l+r 2 /2f 2 K l + r2/f 2)“ ,/2 -l] _ i 2k f [~I+(l + r 2 / 2 f 2 ){l- ( r a / 2 f 2 ) + 3 / 8 ( r 2/f 2 ) 2 - • • }1 

e - e L J 


~ e ikf(r/f) 4 /4 


Substituting these approximate terms in equation ( 87 ) we obtain 


K 


i 


-ie‘k( ftd l 2 ik( f-diKr/f ) 2 /2 ikf(r/f) 4 /4 

\(f +dj e e 


( 88 ) 


For values of (j)less than our previous limit of 0. 14, t.r approximations in each of the phase terms 
is accurate within 2% of its exact values. It shall be noted that neglecting terms in the exponentials 
as we have done here is valid only since we are going to consider phase variations less than one 
cycle. 


Since we are trying to eliminate the phase variations of K t , equation (88) indicates that the 
optimum choice of the distance d 1 equal to f eliminates the first phase term. Thus for the case 
when dj is chosen equal to f , equation ( 88 ) can be reduced to 


K. 


ie 


i 2k f 


2Xf 


ilk f ( r/f ) /4 ^f or ^ - f j 


(89) 


If we substitute equation (89) for K x in equation (81) we obtain 


°( x 2’ y 2 ) = ” — 2 jje ikf < r/f ) 4 / 4 F(p, q) e l2 ^ PX;2 qyj ^dpdq 


(90) 


When the value of (jJ is limited so that the phase factor appearing under the integral of equation ( 90 ) 
can be neglected, the output image 0(x 2 , y 2 ) is given by the equation 


it 


0(x 2 , y 2 ) = Kj K 2 A. 2 f 2 q) e” 1 


27T(px 2 +qy 2 ) 


4>dq 


(91) 
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where 


K 


i 


-i t> 


i 2 U l' 


2 .V f 


Equation (91) is identical to equation (76) which we have shown to be the desired form for the 
Fourier transform representation of optical imaging. 


To derive a specification for the maximum limit on (jj which allows the variable term in K t 
to be neglected, we will consider the effect of the phase term for a particular F(P, q) . 


F(P> q) “ »(q) [A 0 b(p) + ^{s(p- p 0 ) + §(p+ P 0 )J 


(92) 



Figure 14— Location of frequency terms in the specitrum plane. 

The locations of the frequency terms contained in equation (92) are diagrammed in Figure 14. 
Since, by definition, the delta function S(q) is equal to zero for q unequal to zero, equation (92 ) 
represents the spectrum of a signal which varies only in one - dimension. That is, there are no 
frequency components in the y direction; therefore, the signal is constant with respect to the y 
coordinate. Rather than interpret equation (92) as the spectrum of a particular signal, we can 
also assume that we are considering only three sample points of a more general spectrum. Since 
there is nothing to single out the x direction in an optical system, our analysis will apply to a set 
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of spectral points along any radial axis in the frequency plane as indicated by the r axis in Figure 
14. This is obvious if we consider the fact that we can arbitrarily select any orientation for our 
x, y coordinate axes. It can also be shown that the terms which we will use to specify a maximum 
limit on (f) also apply to the general case. We will, therefore, simply interpret the results for the 
special case of equation (92) as a general criteria for neglecting the undesired phase factor in 
equation (90). 

Substituting for F( P , q) as given by equation (92) and applying the interpretation of our discus- 
sion above we can simplify equations (90) and (91) to the forms: 


°( x l) = K J« ikfc,</f>4/4 [a 0 S(p) + § {«(p-P 0 ) + Mp + Po)}] e " i27,pX2 d P 

°(x 2 ) * k|[a 0 S(p)+-| {s( P "P 0 ) + S(p + P 0 )}] e' l2Wp * 2 dp 
The new factor K in equation (93) and (94) is defined as 

-iK 2 k f e i2kf 


The integral with respect to q was taken by applying the sifting property of the delta function S(q) . 
The y 2 dependence has been dropped on the left side of equations (93) and (94) since the output 
image varies only with respect to the x 2 coordinate. The left side of equation (93) is underlined so 
that we can identify the ensuing results of equation (93) and (94) throughout the remainder of our 
discussion. 

The first exponential in equation ( 93 ) can be rewritten using the definition p = jj and the 
integration with respect to p is performed simply by applying the sifting property of the delta 
function: 


( 94 ) 


( 95 ) 


°( X 2 ) " K Je^P) 4 / 4 [a 0 8(p) + | |s(p+ p 0 ) + s(p- p 0 )jj e * 27rpX 2 dp 
= k[a 0 + | e ikf(Xpo)4/4 {e i2Vp °*2 +e _i27rP ° X2 }] 

°( x 2 ) = k[a 0 +Be lkf(Xp0)/4 cos2 7rp 0 xj 
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The result after integrating equation (94) is similar to equation (96) except for the exponential 
which appears in equation (96): 


°( x 2 ) K[A„ ♦ B cos 2 p„ x 2 ] ( 97 ) 

Equations (96) and (97 ) represent the amplitude distribution of the output image produced by the 
frequency plane distribution Fcp, q) given by equation (92). Equation (96 ) represent the output 
0( x ) when the phase factor is considered mid equation (97) represent the output 0(x ) when the 
phase factor is neglected. Comparing equation (96) and (97), a criteria for neglecting the phase 
factor is still' not very apparent since the significance of the exponential is not very clear. 

If we consider the observation of the output image, we must deal with the intensity rather than 
the amplitude as given by equations (96) and (97). The intensities are given by the relations 


f(x 2 ) ; 


!°h)i 

; 2 

(98) 

I ( X 2) 


l°h)l 

2 

1 

(99) 


The starred terms in equation (98) and (99) represent the complex conjugate of the unstarred terms. 
Using equations (96) and (97) in equations (98) and ( 99) respectively, we obtain 


[ k f "1 

_____ +B 2 cos2 2r/p 0 x 2 +2A 0 Bcos (Xp^ 4 cos27rp 0 x 2 J ^ 

l(x 2 ) = |k! 2 [A q 2 +B 2 cos 2 2/r p Q x 2 + 2A Q Bcos 2?Tp 0 x 2 ] (101) 

Now comparing equation (100) and (101), we find that the phase factor introduces a cosine factor 
which attenuates the cos 2??p 0 x 2 component in the observed image. In the general case, the ampli- 
tude B of any one component will be considerably less than the component A . Therefore, the 

third term in equations ( 100) and ( 101) represent the larger of the two x 2 dependent terms in the 

k f 

image intensity. It is desirable to limit the maximum value of p Q to obtain a value of cos T (^P 0 ) 4 
as near to one as possible so that the Fourier transform representation used in deriving equation 
( 101 ) can be considered a good approximation. 
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We can express the cosine term as a function of x by the definition p jj and since our 
results will apply to the general case we can replace x by the more general notation r . Thus we 
can express the cosine term as 


COS 


kf /, v 
T V‘Pq) 


kf 

COS -jf 



( 102 ) 


In equation ( 102 ) the frequency p 0 is given the general interpretation of a spatial frequency in the 
direction of an r axis (see Figure 14) and p 0 and r are related by p 0 y-j . That is, equation 
( 102) applies to the general case of a spectrum along any radial axis r in the back focal plane F' 
We can express (f) as a multiple of ($r) 4 by defining a factor m by the relation 


r 

7 



(103) 


Substituting for j in equation ( 102 ) we obtain 


kf 

cos 



“ COS ITl 4 


( 104 ) 


/ 77 \ * 7 4 

We can recognize the ultimate limit on m by noting that for m = ^ j the cosine term as given by 
equation ( 104) is zero (i.e. cos \ - 0 ). For this value of m the Fourier transform result of 
equation (101) is completely in error with respect to the third term, since the cosine term present 
in equation ( 100) is zero and the third term is eliminated. Thus for m - 4 our ideal two step 

Fourier transform representation yields a term which does not exist in the actual image given by 
equation ( 100 )* t For values of m less than (- 3) 17 4 the cosine of equation ( 104) has non - zero 
values as shown by the curve of Figure 15. Selecting a lin.it on m is based on specifying how accu- 
rately the third term of equations ( 100) and ( 101) should agree. A value of m = 0 is necessary to 
have complete agreement between equation (100) and (101); however, m = 0 corresponds to a 
frequency p 0 : 0 which corresponds to a dc term only. Thus a compromise limit must be estab- 
lished between ihe limits m = 0 and m = ^) 1/4 . 


To determine the limitation on m , it is necessary to specify the des ' 1 >d accuracy of the third 
term in equation ( 101) as compared to the third term in equation ( 100). Again, the accuracy of 
our approximation can be given in terms of a fractional error defined as 


1 - cos m 4 


cos nr 


cos nr 


- 1 


(105) 


•This accounts for the frequent neglecting of phase terms only when less than rr/2 which appears in many references dealing with ap- 
proximate solutions of diffraction problems. 
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Figure 15— Graph of cos m 4 vs. m. 


The error in the third term of equation ( 101) is then + 10 O E.% compared to the exact term in 
equation ( 100), Note that the error E,. does not represent a fraction of the image intensity. 

The error E^ corresponds only to a particular term in the image intensity. In tlie general case there 
would be a series of such terms and the maximum E^ would be determined by equation ( 106). This 
E^ would represent the maximum error in terms of the form of the third term in equation ( 101 ) 
and would correspond to the term involving the highest frequency of interest. 


Specifying the maximum allowable error is rather arbitrary and will usually depend on the 
particular application considered. However, for an example we can consider specifying a limit 
of 2% accuracy for our approximation, i.e, E^ ~ 0.02 . Substituting in equation (105) we obtain 
the condition for the maximum value of m : 


cos m 


4 > 


1,02 


0.98 


(106) 


From Figure 15 we find that the relation ( 106) requires values of m less than or equal to 0. 67. 
Using the maximum value m = 0,67 equation (103) becomes 



(107) 
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r - 



l 


f 

To obtain a numerical result for comparison with our previous limit j mox “ o. 14 , we will again 
consider a wavelength A. = 54151 * 10 ' 8 cm and focal length f " 10 cm. Substituting in equation ( 107 ) 
wfj find 



= 0,67 



0,67 


( 4x 5461 * 10‘ 8 V /4 
\ 2rrx 1(T / 



- 0,03 


(108) 


Equation ( 108) specifies the aperture limit in the frequency plane F' to assure an error limit of 
less than 2% due to neglecting phase variations. We note that this phase limit restricts the frequency 
aperture to approximately one - fifth of the previous value of j max = 0 . 14 which was sufficient 
for the linearization and amplitude approximations. The corresponding frequency limit is given as 


p 


max 


= * (') s 0 03 - 

K ' f /mnx 5461 x 10” 8 


550 cycles/cm 


(109) 


Thus for spatial frequencies less than 550 cycles /cm, neglecting the phase term to obtain the Fourier 
transform representation of equation (91) introduces an error of no more than 2% in terms 0 the form 
of the third term in the image intensity of equation ( 101). Again we can point out that for most prac- 
tical cases the frequency capability of present input techniques restricts the possible frequencies 
to a lower value than that specified by equation ( 109). 

We have shown how equations ( 103) and ( 105 ) are used to determine the error E 0 for any 
frequency plane aperture with a radius defined by (^p) . Further we have shown for a particular 
case (\ = 5461 A and f = 10 cm) that the limit (j) may - 0.03 provides an accuracy within 2% for the 
terms in which the phase variation appears. It has also been noted that this phase approximation 
requires a tighter restriction on the maximum frequency terms. In fact, for examples used the max- 
imum frequency is one - fifth of that allowed for an accurate amplitude approximation. Of course, 
this further restriction of the frequency range of interest will also improve the accuracy of the 
amplitude approximation. 

d 

We can refer back to Figure 12 to consider the amplitude error E A for the values j = 1 and 

= 0.03 assuming = j. The point corresponding to f = 1 and — p = 0.03 is located 

below the curve corresponding to E A = 0 . 005 . Therefore, the further restriction on ^“p required 
for the phase approximation reduces the error in the amplitude approximation to a value less than 
0. 5%. This result shows that the restriction we have considered in this section not only provides 
a Fourier transform relation which is accurate in phase, but also improves the accuracy of the 
amplitude approximations previously considered. 
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Within the limits presented in this section, the two -lens optical imaging process can be described 
by the two equations: 


a/ \ - ~i e ‘ 2kf r , , ~ie‘ 2kf | ff , > -i27r(p)( +qy ) /iin\ 

A ( x ’ y) |_2 A f J F (P' <0 L 2A f Jjj s lV y i) ° dx ! d yt (110) 


°( X 2* y 2 ) # G 


IkR , 


- fo i2k f 1 ff 

L 2 ( f +d Dj JJ 1 


F(p, q) e 2 < ' P) ' 2 IY2 )dpdq 


(HD 


In practice only the variations in amplitude are of interest and the constant factors within the brac- 
kets are dropped 


A( x , y) 


F(P, q) 



\ "I2w(px +qy ) 
yje 


dXj dy. 


( 112 ) 


°( x r » 2 ) 



ik R , 


If 


F(p, q) e 


M2w(px 2 +qy 2 ) 


dp dq 


(113) 


Equations (112) and (113) represent the form of the optical Fourier transform representation com- 
monly used. These equations describe the relative amplitude and phase variations of spectrum 
A(x,y) and image 0(x 2 , y 2 ). Note that the phase term e ikR2 is retained in equation ( 113 ). This 
factor has no effect on the image intensity since multiplication by the complex conjugate eliminates 
this term. However, if the image 0(x 2 , y ) is to be processed further by another lens, the effect 
of the phase factor e ,kR 2 must be considered. In such cases, our criteria for neglecting the vari- 
ation in phase due to the factor ® lkR 1 must also be re - evaluated since the criteria developed above 
was based on image intensity effects. 

OPTICAL CORRELATOR SYSTEMS 

We will now consider a three lens optical system as shown in Figure 16. In this systeip the 
signal plane F is assumed to be in the front focal plane of lens L l and each of the other lenses 
(L 2 and L 3 ) is located so that its front focal plane coincides with the back focal plane of the pre- 
ceding lens. With this configuration the amplitude distribution corresponding to the input signal 
to each lens is the output signal in the back focal plane of the preceding lens and is a focal length 
in front of the lens. This location of the signal planes provides the advantage of eliminating the 
phase terms dependent upon the distance fron the lens to the input plane as discussed in relation 
to equations (88) and(89). The optical system in Figure 16 consists of a two lens imaging system 
as discussed irvthe preceding section followed by a third lens which produces a Fourier transform 
of the light amplitude distribution of the image. As pointed out at the end of the last section, the 
processing of the image 0(x 2 , y 2 ) by an additional lens involves its amplitude rather than the in- 
tensity; therefore, the phase effects of each lens will be considered. Throughout this section we 
will assume that the aperture limitations are sufficiently restrictive so that the linear frequency 
and amplitude approximations developed earlier are valid. The focal lengths of the three lenses 
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' Figure 16-Optical correlator system. 

!| 

are assumed equal to simplify our analysis; in the general case, unequal focal lengths would intro- 
duce magnification or demagnification. 

In the optical system of Figure 16, lens l, produces a light amplitude distribution in its back 
focal plane f' which is proportional to the Fourier transform of the input signal S(x x , y x ) except for 
I a multiplicative phase factor. As discussed in reference to equations (112) and (113) we will drop 
| all constant factors and retain only terms which vary with respect to the coordinates in the four 

I signal planes of interest (f, f\ F", F 0 ). Using only the variable exponential in equation ( 89) for 

K t , the amplitude in the F' plane is given by equation (75) which can be written as 


A(x, y) 


»ikf(r/f) 4 /< 


F(p, q) - e ikf < r/f ) 4 / 4 jjs(x x , y^ e ' 2 ^ PXl+qy ‘^ckj dy x (114) 


l For our development of a correlator it is advantageous to introduce notation for the signal s^.yj) 

j which accounts for displacement of the signal from some reference position. Referring to Figure 

f 17, we can consider the displacement of a signal point A to the new position A' . If A is a point of 

| the signal S(x x , y x ) , the light amplitude at A is S(x A , y A ) . Since A' is the same signal point as 

| A (it has only been moved), the light amplitude at A' must also be S(x A , y A ) . The coordinates of 

| the point A' are x x = x A + r x and y x = y A + r y . Thus our notation for the signal must be such 

I that if we substitute the coordinates x Jf y x for the point A' we obtain S(x A , y A ) . The required nota- 
tion is S(x x - lfc . y x “ r y ) as can be seen by substituting the values of x x and y x for each of the 
points A and A' . In either case the signal amplitude is S(x A , y A > . Using this new notation for 
I a signal, equation (114) can be rewritten as 


A(x, y) = e ikf ( r/f ) 4 / 4 F(p, q) 


e ikf(r/f )V '’fi 


V'jf- 







Figure 17— Displacement of a signal point in the input plane F. 

The displacements r x and r y are positive when the displacement is in the positive x t and positive 
y directions. 

In equation (115) the function F(p,q) represents, the Fourier transform of the displaced function 
S(x 1 - r x , y t - t ). Thus the F(p,q) in equation (115) includes all the information regarding the 
signal including its displacement. From Fourier transform theory, the transform corresponding to 
a displaced signal such as in equation (115) differs from the transform F(p,q) of the undisplaced 
signal of equation ( 114) by an exponential phase term, e " l 27 ’ r ( r x p+T y q ) . This principle need not 
concern us any further; it is pointed out only to emphasize that the F(p, q) in equation ( 115) corres- 
ponds to the displaced signal S(x 1 - r x , y t - r ) 

The amplitude distribution given by equation (115) appears in the plane F' and represents the 
input signal to the lens L 2 . Lens L 2 performs a Fourier transform operation on A(x, y) and the 
image amplitude in the plane F" is given by 

0 ( X 2’ y 2 ^ = ^( X 2’ y 2 )jj* e lkf ( r/ f )4//4 ' F (P , q) e l2 ”’( PX 2' ,qy 2) 


dp dq 


(116) 


Equation (116) corresponds to equation (81) except that only the variable terms of K ; and K 2 have 
been retained. The exponential appearing in the integrand corresponds to the variable term in k 
as discussed above. The function </<x 2 , y 2 ) in front of the integral represents the variable part of 
k 2 . From the definition of K 2 given under equation (76) the variable part of K 2 is obtained from 
the term e ,kR 2 where 


R 


2 


f 2 + fH + r 2 

*-■*":■* '■ Ti ‘"' '- 1 and r 2 ” x 2 + v 2 

^ f z+ r 2y/2 ana r 2 y 2 


(117) 


Since R 2 has the same form as R, , R 2 can be expanded in the form of equation (86) 


r 2 ( X 2 ’ y 2 ) = ( f +d 2 ) + ( f “ d 2 ) 


>rj 


+ 2f 


1 + 


2f : 


f r 2\l/2 


~ 1 


(118) 


The first term and the - 1 term in the last bracket of equation (118) are constant and can be dropped 
since we are interested only in the variable part. The second term vanishes since d 2 = f in the 
system we are considering. Thus the only variable term in equation (118) is the fraction in the brac- 
kets of the third term. The function <£(x 2 , y 2 ) is therefore given by 


0(x 2 , y 2 ) = 


(119) 


The variable part of K 2 given by equation (119) was derived from the complete expansion of R 2 
rather than from an approximate expansion analogous to equations (88) and (89), since the aperture 
restrictions necessary for the validity of (88) or (89) would require a signal and image aperture 
much smaller than that normally desired in optical systems. For an image aperture defined by 
(l^)max = 0.14 , and f = 10 cm, and X = S'Wl x iff 8 cm , the phase term y 2 ) can introduce 

phase shifts as great as 38 rr radians ( 19 cycles). It was pointed out that the phase approximation 
of equation (89) was accurate within For the image aperture considered here this phase inac- 
curacy can be of the order of 0. 4 cycles. This magnitude of phase error may not be negligible and 
therefore the more complete exponential was used in defining 4>(* 2 , y 2 ) by equation (119). 

Returning to the image amplitude distribution 0(x 2 , y 2 ) given by equation (116), we will 
change the notation to take into account the possibility of image displacement corresponding to the 
signal displacement considered previously. In the last two sections it was pointed out that the imaged 
amplitude 0(x 2 , y 2 ) corresponds to an inverted replica of the input signal S(x 1 , yj ). This inverted 
property of the image applies to the image motion as well. That is, if the signal is displaced in 
the positive x, and direction, the image is displaced in the negative x 2 and y 2 directions. Thus, 
if 0(x 2 , y 2 ) corresponds to the inverted image of S(Xj , y } ) , the displaced image corresponding to 
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S(Xj -t x , y j - r ) is obtained simply by reversing the sign of the displacement to obtain 0(x 2 + r x , 
y 2 + r ) . Using the displacement notation for the image amplitude distribution, equation (116) 
can be rewritten as 


°( x 2 + V y 2 +T y) ~ ^( X 2’ y 2 ) t/f F(p, q) 


“l27T(px 2 +qy2) 


dp dq 


( 120 ) 


The final lens L 3 in Figure 16 operates on the light amplitude distribution appearing in its 
input plane F" . For an optical correlator operation a reference signal R(x , y 2 ) is inserted into 
the plane F" in the form of an amplitude transmission function of a photographic transparency. In 
this case the light amplitude distribution operated on by lens L 3 is that which appears on the output 
side of fra reference transparency. This light amplitude is given by the product of the incident 
light amplitude 0(x 2 + r x , y 2 + v y ) and the reference transmission function f((x 2 , y 2 ) . Thus the light 
amplitude distribution W in the output plane F 0 is given by the equation 


w(> 


ikf( r 3 /f ) 4 / 4 


if 


R (v y 2 )°( x 2 +T , 


y +r 

J 2 y , 


-i27r(sx 2 +ty 2 ) 


dx 2 d y 2 (i2i) 


where 




2 - 


2 + v 
1 y q 


Since we are considering a system which terminates at the F 0 plane, the intensity will be detected, 
measured, or recorded in the f q plane. The intensity in the output plane is given by the product 
of equation ( 11, and its complex conjugate. The complex conjugate product of the exponential in 
front of the integral results in the cancellation of the exponential. Thus, we can drop the exponential 
in equation (121) since it will not affect the detected intensity output. Equation (121), therefore, 
can be simplified to 




! )°( 


X n +T 


V + T ■ 
y 2 f y I 


*i277(j 


SX 2 +ty 2 ) 


dx dy , 


(122) 


Finally, if we consider only the point located at the intersection of the optical axis with the plane 
F 0 (back focal point of L 3 ), s - t = 0 (i.e. x 3 = y 3 = o) and equation ( 122) reduces to 




y 2 )°( x 2 +T x- 


y 2 + r y ) dx 2 d y 2 


(123) 
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where 


W ( T X- r y) " W ( X 3 = O' y 3 ^ O' T x' T y) 

Equation ( 123 ) corresponds to a two - dimensional correlation function which implies that the light 
amplitude at the back focal point (x 3 = y 3 =0) of the lens L 3 is given by the cross correlation of 
the reference R(x 2 , y 2 ) and the image amplitude 0(x 2 , y 2 ) . Thus as the input signal is displaced 
the variation of the light amplitude W( r H , r y ) corresponds to the variation of the correlation function 
with respect to the displacements r x and r y . Note that the correlation function defined by equation 
(123) involves the image amplitude 0(x 2 , y 2 ) which is inverted with respect to the input signal 
S(Xj , y,) . Therefore, if R(x 2 , y 2 ) is not a symmetrical function, it must be oriented correctly 
with respect to the image 0(x 2 , y 2 ) rather than with respect to the input signal S(x, , y t ) . 

We shall briefly consider the implication of the steps from equation ( 122) to equation ( 123). 

This step in our derivation was accomplished by stating that we would consider only the single 
point in the output plane F ;J which lies on the optical axis (i.e. x 3 = y 3 = 0 ). In practice it is 

physically impossible to isolate a single point. The best attempt we can make is to restrict our light 
measurement or detection to a small area about the selected point. The light amplitude at points 
within this area (except for the one point on the optical axis) is given by equation (122) rather than 
( 123). The light amplitude distribution will not be uniform over the finite area of measurement due 
to the phase variation involved in the integral of equation ( 122). For example, if we use a pinhole 
aperture 10 microns in diameter to define our detection area, the phase term in equation ( 122 ) can 
vary as much as 47 r radians (2 cycles) over the range of the image aperture (assuming r : Jmax 
- 0. 14f , f = 10 cm, A. = 5461 x 10' 8 cm). The effects of the phase term in equation (122) is to reduce 
the light amplitude at points off axis since the contributions to the integral are not in phase. There- 
fore, the actual light available through a pinhole aperture located in the F 0 plane at x 3 ~ y 3 = 0 
will be less than that found by assuming the light amplitude given by equation ( 123 ) appears at all 
points within the pinhole aperture. We will not consider this problem any further here since the 
analysis would depend on the type of photo - detector or measurement technique used. We will assume 
that the variations involved are small enough so that any measurement will yield values proportional 
to the square of the amplitude given by equation ( 123). 

As pointed out above the correlation function defined by equation ( 123) involves the image 
amplitude 0(x 2 , y 2 ) rather than the single amplitude S(x x , y x ) . As defined by equation (120) 
the image amplitude contains phase terms not present in the signal. A correlation operation can 
be performed based on the image as given by equation (123); however, the reference signal R(x 2 , y 2 ) 
would have to be selected in terms of the image 0(x 2 , y 2 ) including the phase terms. The cor- 
relation function obtained would correspond to a distorted signal rather than the actual signal 
S( x x , y x ) . The presence of distortion due to the phase terms in equation ( 120), therefore, com- 
plicates the analysis and determination of the correlation process. For example, the image amplitude 
0(x 2 , y 2 ) will be complex (phase variation as well as amplitude) and for complete correlation a 
complex reference signal is required. Such reference transparencies are difficult to produce. The 
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phase distortions are commonly neglected and a reference signal is selected based on an ideal image 
(no distortion) of the input signal. We will now proceed to analyze such a system to determine the 
effects of the undesirable phase terms present in our equations. 

Let us consider a signal which would produce an ideal image amplitude defined by 

°( x 2 + r x ) r 22 B " COS 277 p -' (* 2 + T * ) ( 124 ) 

n 

Equation ( 124) defines a signal image composed of a series of cosine harmonics in one dimension. 

A one - dimensional signal has been chosen to simplify our analysis. Referring to equation ( 120) 
we find that each frequency term in the image has a phase term e ikf(A - p n> 4 associated with it and 
the image also has a phase term </j(x ? , y 2 ) associated with it. Thus for the actual image we would 
have 


0(x 2 +r x ) 



B 


ikf(\p n ) 4 /4 
e cos 


2 ”P„(*2 



(125) 


We can consider a reference signal without phase given by 

r(x 2 ) = 22 R « cos 277 p "> x 2 (i26 ^ 

m 

Tbs reference signal R(x ) defined by equation ( 126) has been selected to have the same cosine 
harmonics (m = n) as the imaged signal being considered. Note that the reference signal defined 
by equation ( 126) does not contain the phase terms present in equation ( 125). The product of ref- 
erence and image for the ideal image of equation ( 124 ) is given by 

R(x 2 ) 0 ( x 2 +T x ) = 22 BnRm COS27TR " X 2 COs2rr Pn K +T x) (127) 

n , m 

The product of reference and image for the actual image of equation ( 125 ) is given by 


R( x 2 )°( x 2 + r x ) = *(*,)£ 
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R e 

m 


ikf''\p n ) 4 /4 


COS 2 TT p X_ COS 
m 2 


^Pn ( X 2 +T x) 


(128). 


n,m 

Substituting equation ( 127 ) and ( 128) into equation ( 123), we obtain for the correlation function of 
the ideal image: 


w 


K) = 21 Bn R * | COS 277 p m X 2 COS 277 p n ( H 2 + T x ) dx 2 

n,m 


(129) 
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and for the correlation functions of the actual image: 


"<'*> L 

n , m 


B s R 

n m 


ikr<\p n ) 4 A 


* 2 ) co S 2rrp m x 2 cos 2n p n (x 2 + r x )dx 3 


(130) 


\ 4 / 

Comparing equations ( 129) and ( 130) we find that the term c‘ Pn affects the phase of each 
( term in the double summation. From our previous discussion of frequency limitations with respect 
to this phase term we can show that applying the limitation developed for imaged intensity limits the 
phase variation of this tern, to approximately 7°. In summing terms which are not in phase the 
result will be less than summing the same terms in amplitude only. Thus the presence of the phase 
term e ,kf(>p n) A h as the effect of reducing the value of W (r x ) in equation ( 130) as compared to 
equation (129). However, since the maximum phase will be about 7° the difference due to this term 
will be small. The phase term <£(x 2 ) appears in the integral of each term in the sum and has the 
same effect on the integral ( can be considered as summation ) as the phase term discussed above 
had on the summation. However, as discussed above the phase variations of <£(x 2 ) ranges over 19 
cycles and the effect on the value of the integral will be correspondingly greater. The actual magnitude 
of the reduction in W(-r x ) due to these phase terms is difficult to evaluate in general since the re- 
duction will depend on the for in of the signals involved. However, from our discussion here it is 
apparent that the actual correlation function observed will be smaller in amplitude than that pre- 
1 dieted using an ideal image. This result is obvious if we consider that the presence of the phase 

I terms in the actual image produce a mismatch between the signal and reference and therefore the 

I correlation will be reduced. The effects of these phase terms can be reduced by further restricting 
! the frequency range p m(iv (or r v ) and signal and image aperture size which would limit the var- 
| iation of the phase terms. We will not proceed with an analysis of the required limitations since 

f the analysis will depend to a large extent on the type of signals involved and the correlation results 

f desired. Here we have developed the equations necessary for such an evaluation and hopefully have 
‘ pointed out the significance of the various effects which appear in an optical system. 


PHASE CORRECTIONS 

In the last few sections we have discussed the effects of undesirable phase terms in optical 
systems and have demonstrated that these effects can be minimized by restricting the size of signal 
apertures and the spectral range of the signals. An alternative approach can be pursued by inserting 
phase corrections into the optical system. Such phase corrections can be implemented by inserting 
sheets or plates of transparent materials whose thickness or index of refraction has variations which 
introduce phase terms opposite in sign to those introduced by the system. 

The basic equation representing the Fourier transform operation of a lens was given by equation 
(45) as 


A(x, y) 


_ ie ikR(*,y, rr 

\( f + d) J J A ( x ! 




*i4*{p* ,+qy i) 


dx. 


d A 


(45) 
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Rewriting this equation retaining only the variable part of the terms outside the integral we obtain 


A(x, y) 7 ( x , y) jJ A ' ( x t , y t ) <■ 12 0’V'i v j) d X) dy 


(131) 


where 


/(x, y) 


iakf [( 1 + r*/2f 2 5 ( t + r 2 f 2 )” 1 2 ] 


assuming <1 


as derived in equation (119). As pointed out in all our discussions the phase term / ( x , y) destroys 
the simple Fourier' transform representation of lens focussing properties since the integral part of 
equation (131) corresponds to a Fourier transform by itself. Let us consider inserting a phase cor- 
rection plate into the back focal plane of a lens with transmission properties given by 


P(x, y) : A o iC ;* (x. y ) 


(132) 


where 


o 
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■/ (X. y) - 


constant 

constant 

-i2kf[(l+r 2 /2f 2 )(J+r 2 /f 2 )' 1/2 l 


The light amplitude distribution appearing at the output side of the plate will be 


A(x, y)P(x, y) = JJa' (x t , yj e l 27 T ( px i +qy i) dXl dy t 


(133) 


where the constant term A 0 e lc has been dropped and ^(x, y) /(x, y) =1. Thus by inserting a 
phase plate with transmission properties given by equation ( 132 ) in the back focal plane of each lens 
in an optical system the phase terms are eliminated. From the definition given by equation ( 132) 
we find that the phase correction depends on the focal length f of the lens and the wavelength \(k = 
of the light. The phase correction is not dependent on the signal used and therefore a phase plate 
can be made for the lens and wavelength to be used in the system. Of course, the correction of phase 
by this method requires an accurate technique for producing the phase plate and positioning the plate 
in the optical system. In any case, we have shown that the elimination of undesirable phase terms 
is possible at least in theory. Any inaccuracies in production or location of the phase plate may be 
acceptable as long as the phase terms are appreciably less than before the plate was introduced. 
Assuming the phase plate is an accurate representation of the transmission function of equation 
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(132), the Fourier transform relation of equation (133) will be valid. With the relationship given ^ 
by equation (133) the operation of spectrum analyzer, imaging and optical correlation systems can 
be described by the ideal cases used in the respective discussions and no undesirable phase terms 
appear in the equations. 

PHASE TERMS WHEN d / f 

In ovr consideration of phase terms we considered the special case of an input plane coincident 
with the front focal plane of a lens (d : f). This special case was chosen to eliminate the phase 
effects of a term proportional to (f - cl). From equation (118) we can write a complete expression 
for the variable part of the exponential term e ikR as 


, i.k((l-0( l+r 2 /f 2 r l/2 i2kf(l + r 2 /2f 2 )(H-r 2 /f 2 )* ,/2 

'/<*, y) " e C 


(134) 


Equation ( 134) reduces to the form of equation (119) when d f . If we can restrict our consider- 
ation to a rather limited range in the back focal plane of a lens, we have shown that equation ( 134) 
can be given to a good approximation in the form of equation (88) 


</<X. y) “ e ik(N)(r/f) 2 / 2 e i,"r/C) 4 /4 (135) 

We can consider equation (135) as a representation of the phase in a back focal plane containing a 
frequency spectrum while equation ( 134) is a more accurate representation which applies in a back 
focal plane containing an image of the input signal. This application of equation ( 134) and ( 135) is 
based on the relatively larger apertures commonly used in the signal and image planes. 

In the systems which we have considered here the complete phase variations as given by equation 
( 134) appear only in the correlator system. This can be seen by noting the presence of 4{* 2 , y 2 ) 
in equation (120). Since this phase factor is expressed in terms of the coordinates x 2 and y 2 of 
the image phase we cannot use a very restrictive aperture limitation without severely effecting our 
signal handling capability. Therefore, the approximation of equation ( 135) will not be valid and 
0(x 2 , y 2 ) in equation (120) will have the form of equation (134) with d 2 and r 2 replacing d and r 
respectively 


0( x 2 , y 2 ) 


e ik(d 2 -f )(i+r 2 2 /f2)“ 1/2 e l2kf(l + r 2 /2f 2 )( 1+r 2 /f 2 )" 1/2 


(136) 


where r 2 2 = x 2 + y 2 2 and d 2 is the distance from the spectrum plane f' to the lens l 2 (see Figure 
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16). In the sample correlation function of equation (130) we can see that the additional phase term 
dependent on (d 2 - f> will increase the effect of ,'(x 2 ) on the integrals. In practice, a system would 
be specified on the basis of locating lens l 2 so that <l 2 f. However, the exact positioning of the 
lenses in an optical system is obviously a practical impossibility. Thus the additional phase term 
containing (4 2 - f) represents the phase distortion introduced by inaccuracies in the implementation 
of the system. Since the quantity (<l 2 - f) represents an inaccuracy its value will usually be undeter- 
mined. Therefore, the first term in equation ( 136) represents an undetermined phase error in the 
optical correlator system. If a guess or estimate of the tolerances in the system can be made, this 
error term can be used to determine the maximum distortion of the correlation function by analysis 
similar to that implied by equation (130). 

Since the variation of the phase term containing (d 2 - f) in equation (136) is not known specif- 
ically, the elimination of this term by a phase correction plate is not possible. Thus in a system 
containing phase correction plates only the second term of equation (136) can be eliminated. In 
such systems /(x 2 , y 2 ) is completely given by the position error term 



) 


lk(d 2 “f )(l + r 2 2 / f 2 ) 


1/2 


(137) 


The distortion of the correlation function in a phase corrected system is therefore completely depen- 
dent upon the positioning errors. Again referring to the sample of equation (130), /<x 2 ) would be 
given in the form of equation (137). The phase term in front of the integral of equation (130) would 
also be replaced by an error term from an expression such as equation (135) as will be discussed 
below . 


The phase term given by equation ( 135) represents the variable part of equation (88). The ex* 
ponential dependent on (f - dj) represents an error term due to lens positioning. To account for 
this error the complete phase approximation of eq ation ( 135) must be used in place of the K t expo- 
nential of equation (89). Thus the error phase term will appear throughout our previous analysis 
wherever we have used the K 1 term. 

We considered the effect of the K, phare term on the image intensity and on the correlation 
function in earlier sections. In our correlator discussion the variable phase term of K t appears 
in the integral used to define the image amplitude distribution in equation (120). To account for 
errors in placement of lens (see Figure 16) the exponential e ikf < r/f > 4 /4 in equation ( 120) must be 
replaced by a phase term of the form of equation ( 135 ) which can be written 


<K*> y) 


ik(f-d.)(r/f) 2 /2 

e A 


e ik((r/f) 4 A 


(138) 


Referring to our sample correlation of equation ( 130) the phase term given by equation ( 138) will 
replace the e ikf < r/f > 4 /'*term in front of the integral. The error phase term has the affect of adding 
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an additional variation to the phase of the terms in the summation. For a phase corrected system 
the i.fca»/fr/4 term is eliminated and the undesirable phase difference of terms in the summation 
will be dependent only on the accuracy of the system implementation. 

In our discussion of imaging systems we defined a factor m by equation ( 103) and developed a 
method for determining the accuracy of the image intensity based on this parameter. To extend this 
method to include the case for d, '/■ f we merely redefine m by the equation 

- * $(!)’ [('-,) 1 o»> 

which is obtained from the exponents of the terms in equation (138), For d, less than 1' the limits 
on m defined in our previous discussion will apply to equation (;« 39) for the maximum value of r . 

It is noted that since (f - d ) is a positive quantity when d is less than f the required limit on 
-p will be less than that determined for the case - f . When d t is greater than f , (i ~d ) 
is a negative quantity which would imply that the value of -p can be greater than that for the 
case d = f . This is true except for cases in which d is sufficiently greater than f so tlv’t for 

1 r r 1 

some value of j less than -p the value of equation ( 139) is greater in absolute value than for 
-p . That is, since (f - d. ) is negative the right side of equation (139) is zero at r 0 , be- 
comes negative as r increases until it reaches a maximum negative value and then increases to 
positive values. Depending on the value ofd and the limit -p it is possible that the phase at 
the maximum negative value is greater than that at the aperture limit ~p . In such cases, the 
maximum negative value must be considered rather than the end value at -“f-* and the aperture may 
have to be restricted to values below this maximum. In any case the brackets on the right side of 
equation ( 139) must be considered as an absolute value symbol when the quantity within is negative 
so that m will have real values. In other words, we are concerned with the magnitude of the phase 
variations and not the sign. 

For systems with phase correction only the first term of equation ( 138) will remain and ( 139) 
will be simplified to 


">* = i (t ) 2 ( f “ d i) d«) 

Equation ( 140) can then be used with the image intensity criteria developed earlier to consider the 
effects of positioning error for lens Lj in phase corrected systems. 

In most of the literature the phase corrected form of ( 138) 


y) = 


ik(f”dj)( r/f) 2 


(141) 


is used even though phase correction techniques may not be employed. This application of ( 141) 


requires that the frequency limitation be sufficient so that the e ikf ( r/f >%term can be neglected. This 
application also implies that the term (f - d t ) is much greater than the maximum value of (jV . 

If this condition does not hold the neglected term will contribute a phase comparable to that of equa- 
tion (141) which would then be in error. Conversely, if (f - d,) is not greater than | (jj* and 
the e ikf < r/f)<l / 4 term is considered negligible, than the term given by equation (141) is also negligible 
since it is comparable to the neglected term. 

In this section we have outlined the procedure for taking into account the additional phase term 
arising from inaccuracies in the positioning of lenses. It was pointed out that since these terms are 
due to inaccuracies they are generally not specified completely. The worse case, however, can be 
specified by estimating the maximum error in the position of a lens. From this extreme estimate the 
necessary aperture limitation or the evaluation of errors in the desired optical outputs can be deter- 
mined for a worse case analysis. Unfortunately, due to the undetermined nature of these terms, 
phase correction cannot be used to eliminate their effects. 


SUMMARY 


The derivation presented in this report demonstrates that the Fourier trans- 
form representation, of a focussed diff raction pattern is a reasonable approximation 
for describing the operation of coherent optical systems with lenses. The basic 
assumptions consistedof the ideal focal properties of a lens arid the use of perfectly 
coherent light. Except for undesirable phase effects, it was demonstrated that 
the Fourier transform representation is obtained as a good approximation by im- 
posing limitations on the size and frequency content of signals allowed. The phase 
terms can also be eliminated by aperture limitations; however, the restrictions 
are more severe. Depending on the application, a trade off must be made between 
the limitations required for elimination of undesired terms and the desired signal 
size and frequency content. 

Techniques for evaluating the effects of the various approximations and for 
analyzing the operation of ideal optical systems have been presented. For speci- 
fied signals and applications these expressions can be used to determine the theo- 
retical errors in assuming ideal operations as is commonly practiced. The anal- 
ysis presented is by no means complete; however, itis hoped thatit is sufficiently 
detailed to provide a clear insight into the required approximations. 

This report represents an initial step in the development of a detailed analy- 
sis of the capabilities of optical processing systems. Further studies are required 
to formulate complete criteria and analysis techniques for practical optical sys- 
tems. Some of the important areas whicV must be considered include; 

1. Lens aberrations 

2. Coherence 

3. Transmission properties of modulation media 

4. Band - limited signal approximations 

These areas were not treated in the analysis presented here since the initial study 
was restricted to ideal systems. The complexity of the mathematical formulation 
of optical patterns can be simplified somewhat by using the notation of communica- 
tion theory 5 . Such methods are becoming quite useful in modern optics studies. 
The development ef these techniques provides a means for avoiding the complicated 
mathematical formulations inherent in diffraction problems. However, any new 
formulations such as these must be considered in terms of the more rigorous for- 
mulation since the various approximations are basically the same in both formu- 
lations. 
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APPENDIX 

DERIVATION OF DIFFRACTION FORMULA 
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Figure A-l— Diagram of diffraction configuration. 

We will restrict our discussion to the somewhat special case of diffraction at an infinite plane 

surface as diagrammed in Figure A-l. The shaded area in the figure represents a cross - section 

of an infinite slab. The basic problem is to determine the electric field at any point P in the dif- 

# 

fraction region which includes all points to the right of the plane boundary surface indicated in 
Figure A-l. When the plane slab is not present, the electric field at any point P could be found 
simply by substituting the coordinates of P into the mathematical expression describing the light 
propagating from whatever light source may be present. In itself, finding a mathematical repre- 
sentation for a given light source is not a simple problem. The light radiated by a source is depen- 
dent upon the mechanism generating the light as well as the geometry of the source. In many cases 
it is assumed that a good approximation is obtained by considering ideal light sources which radiate 
spherical waves (point sources) or plane waves (point sources at an infinite distance). 

Inserting a plane surface into the path of the light waves as shown in Figure A-l complicates 
our problem. Since the presence of the plane effects the propagation characteristics in space, the 
electric field at any point will now depend on the characteristics of both the light source and the plane. 
The characteristics of the plane depend on the type of material of which it is made and these 
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characteristics usually vary from point to point in the surface. Thus we are confronted with the 
problem of determining the electric field in the presence of a surface which can have widely varying 
electrical properties from point to point. As the reader may already know, problems of this nature 
are very difficult and, in fact, very few diffraction problems have been solved rigorously. Fortunately, 
in many cases of practical interest results within experimental accuracy can be obtained by less 
rigorous techniques. 


In order to implement a discussion of diffraction problems, we will now proceed to the deriva- 
tion of a formula for diffraction at a plane surface. This result was first derived by Sommerfeld 6 
in 1896 and as will be demonstrated is effectively a mathematical representation of Huygen’s prin- 
ciple for the special case of a plane diffraction surface. The basic assumption we will start with 
is that the components of the electric field are known at every point on the right hand boundary sur- 
face of the plane slab (refer to Figure A - 1). The methods for determining these field values are 
of . o importance at this point; however, in many cases of interest assuming a multiplying factor 
representing the transmission properties of a thin material provides results in close agreement with 
experiment. For our present purpose, we will simply assume that the value of the electric field at 
every point on the plane boundary surface is known (i.e. can be found easily). 

PLANE BOUNDARY SURFACE 
(ELECTRIC FIELD E KNOWN AT EVERY 
POINT ON SURFACE) 

DIFFRACTION REGION 

. P 

SOURCE I 
REGION 



Figure A-2— Outline of diffraction problem. 


Referring to Figure A-2, the problem we must solve can be stated as follows: 

Given the electric field at every point on an infinite plane boundary, what is the electric 
field at any point P in the diffraction region ? 


I 
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As shown in Figure A -2, we define our diffraction region to be all space on the right side of 
the boundary surface (note that this region does ,ot contain any light sources). In Figure A - 2, 
it is assumed that all light sources are to the left of the boundary surface and that the diffraction 
region includes all points to the right of the boundary. We can assume that our diffraction region 
is in free space (vv city of light is e in all directions) and specify that there are no electric cur- 
rents or charges present in this region. Since the electric field is a vector quantity, its direction 
at any point is as important as its magnitude. As in the case of any vector we can consider the 
electric field in terms of its components in the x, y and z directions. To simplify our discussion 
we will assume that light waves are monochromatic, or vary in time at a single frequency. When 
necessary this discussion can be extended to the general case of non - monochromatic waves by con- 
sidering each separate frequency component as described here and summing up all components. 

Each component E (in x, y, z components) of the electric field of a monochromatic wave will 
satisfy the Helmhotz equation (time - independent wave equation) at every point P in free space which 
contains no electrical sources: 


( V 2 4 k 2 ) E 0 


(1) 


where 

- x, y, or z '* >mponent of the electric field 

ci 2 ^ <9 2 ^ () 2 

ax 2 dy 2 c?z 2 

c " X' 

- an gular f r e qu eney 
” speed r r light 

" wavelength of light 


plane boundary surface and the fact that E must satisfy equation ( 1 ) 
at every point P in our diffraction region, we can derive a formula for the electr ic field at P in 
terms of the values given on the surface. 

We will introduce an arbitrary function V which also satisfies Helmholtz equation: 

(V 2 +k 2 ')V ~ 0 (2) 

There are many functions which will satisfy equation (2); however, we will presently continue to 
use the symbol V and reserve the selection of a specific function until we determine a few additional 
characteristics of V which will allow us to accomplish our derivation. In terms of E and V , we 
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Using the values for E on the 
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can define two vectors F, and f 2 as 


F, E V V 


f 2 VVE 


where 


. . -t* d . .* <> , ,* (1 

V i Tx ' y Ty + k In 

and W and V’E denote the gradient of V and E respectively. We now introduce Gauss's theorem; 


V * F civ 


volume 


If * ■ « 


svj r fact* 


where the volume integral on the left can be taken over any volume which does not contain discon- 
tinuities of the divergence of F (V * f) and the surface integral on the right is over the surface which 
encloses the volume (F must be continuous on the surface so that the surface integral can be found). 
From (3) and (4) we obtain 


V * F. 


VE • VV + E V 2 V 


V y . V E + V V 2 E 


From above we note that (6) and (7) must not have discontinuities within the volume of integration; 
therefore, E and V must have continuous first and second derivatives. Since we are considering a 
diffraction region free of electrical current and charge, E will meet this requirement for any volume 
in the diffraction region. Since we have not yet selected a specific V , we will note this requirement 
and be sure to satisfy it when selecting our V. Thus we can write equation (5) substituting (3), 

(4), (6) and (7): 


JjJv-Fjdv " JJJ{VE • VV + EV 2 v}dv - JJe' 
JJJv * dv « jJJ {W • VE+ W 2 e} dv - JJV 


E V V • ds 


VE • ds 


We can substract (9) from ( 8) and noting that ve*W-W-ve - owe obtain: 


JjJ{EV 2 V- W 2 E}dv = JJcEW-WE} • ds 
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This is Green’s theorem and holds for any function E and V which have continuous first and sec- 
ond derivations in the region of integration. From equations (1) and (2) we know that V 2 E ~k 2 E 
and V 2 v -k 2 v and therefore the bracket on the left hand side of equation ( 10) gives 

E- 2 V ■ V 2 E - k 2 EV ( k 2 EV ) 0 

Since the integrand is zero as given in (11) the volume integral on the left hand side of (10) is zero 
and we can rewrite equation (10) as 


ff 


{ E V - WE } • ds 


( 12 ) 



Figure A-3— Boundary surface enclosing all points except P. 

The surface integral in equation (12) is to be taken over any closed surface which does not 
enclose discontinuous points. For our purposes we will choose the surface as indicated in Figure 
A-3. The outer surface consists of the infinite boundary plane (A) on which the electric field is 
known and a hemisphere (C) of infinite radius which connects the ends of the plane at infinity. We 
define an inner surface ( 2 ) as a sphere centered at P with radius e . Taking A, c, and 2 as our 
closed surface, we have defined the volume between the sphere 2 and the outer surface A-c. If 
we take the limit as e goes to zero, the point p will be the only point in the diffraction region out- 
side the surface of integration. Thus the sphere 2 isolates the point P where we want to find the 
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electric field. The surface integral of equation ( 12) can be written as the sum of the integrals 
ovp*' A , c, and : 


JJ*EW-W/E} * ds ♦ (Tie W ~ Wfcl • (Is « |J'E V VvE' • tis‘ 


( 13 ) 


The surface integral over the hemisphere at infinity can be eliminated through a physical 
argument given by Born and Wolf 4 . In practice a light wave starts at some time and since it prop- 
agates at a finite velocity ( e in free space ) must have an end. We can imagine the infinite hemi- 
sphere continually expanding in front of the light waves. In this way, the contribution of the wave 
on the hemisphere is zero since the light waves never reach the hemisphere. The integral over the 
surface c will therefore be zero and equation (13) can be written 


JJ 


<1 EW - WE} 


ds 



IEVV - WE} 


ds 0 


( 14 ) 


We will now take advantage of our freedom to select a function V in order to simplify equation (14). 
In the integral over the plane surface A we note that the term WE requires the values of 'E on 
the boundary surface. Since we do not know VE on A, we will require V to be zero on the boundary- 
surface to eliminate this term. Equation ( 14) can then be written 




I EW - WE} * ds 




o 


To consider the integral over the sphere we will express the surface element ds 
coordinates: 


( 15 ) 


in polar 


ds “ ** e 2 sin 0 6.0 d$ ? 


where r is a unit vector in a radial direction away from the center at P and fhe minus sign is re- 
quired by the convention that a surface normal is directed away from the enclosed volume. The 
integral over ?■ can be written as 




(" sin [ d^t 2 (EW- WE) • r 
*o •'o 


By vector identity (EW - WE) • r - E jp - V 57 and ( 16) can be written 


- f sin 0 f #[e (£ 2 j^)-e 2 V ||] 
•'0^0 


(16) 


(17) 
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Recall that we want to take the limit as < goes to zero. The term > 2 Vjp in (17) can be eliminated 
if we require V to satisfy the condition 


1 i m ' 2 V 0 
< o 


(18) 


at any point P in the diffraction region. 

(9 K 

Since E has continuous first derivations 77 will be finite and the conditioning given by (18) 
will give 


1 im 
£**0 


<9E 

dr 


0 


Expression ( 17 ) is then given as 


j hint) dll E^e 2 ^7)^ 


(19) 


( 20 ) 


Since E is the value of the field on the surface X and the surface £ reduces to the point P when e 
goes to zero, the limit in (20) can be written 

dV dV 

L e '^ Ee2 d~r S E ( P > Ji™ e2 Tr (21) 


where E(P) is the electric field at P . We will require that V satisfy the condition 


1 im t z 1 

e-*o p 


( 22 ) 


at any point P in the diffraction region. The limit in the brai ’cets of equation ( 20 ) is then simply 
E(P) and (20) gives 


fit 

sin 6 dd #E(P) = - Am E(P) 

•'O Jo 

Substituting the result of ( 23 ) into equation ( 16 ) for the surface integral over 2 we get 


(23) 


II 


EW • ds - 4 tt E(P) = 0 


(24) 


Rearranging terms equation (24) can be written 


1 


E(P) = Tfir JJ EW • ds 


II' 


(25) 
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c? V () 

By vector identity W • ds ds where ^ represents a partial derivative with respect to a co- 
ordinate perpendicular to the plane boundary surface in a direction out from the enclosed region. 
Equation (25) can be written without vectors as 


E(P) 



( 26 ) 


Except for the selection of a function v which satisfies all the conditions we have used, equation 
(26) has the form we require. The left side i3 just the field at a point P, and since the integral on 
the right side is on the plane boundary surface A the E in the integrand assumes the given values 
on the surface. Thus the field at any point P is given in terms of the given values on the surface 
A by formula (26). 


In deriving equation (26) we have imposed restrictions which must be satisfied by the function V. 
Collecting the requirements to be satisfied by v we have 

X. v must have continuous first and second derivations within the region inside the 
boundaries shown in Figure A -3. 

2. (7 2 + k 2 ) V 0 

3. V " 0 on boundary surface 

4. w t ' 0 on boundary surface 

5. lim e 2 V = 0 at any point P in diffraction region 

6. Um e 2 = i at any point P in diffraction region 

e-0 0 r 

Fortunately there is a function which meets all these requirements: 


V = 



( 27 ) 


where r and r ' are defined by Figure A - 4 as 


r ~ distance from P to any point Q 
r' ~ distance from P' to any point Q 


P' = 


k - 


mirror image of P 

(i.e., PP' is perpendicular to the boundary A and d = d' ) 

oj _ 2n 

c ~ k 




That our function V given by (22 ) does in fact satisfy all requirements can be proved by direct 
substitution into the expressions listed above. Here we will only discuss the continuity requirements 
(item 1 above). As defined by equation (27) v has discontinuities at r = o and at r' = o. These 
discontinuities appear at the points P and P' respectively andP' lies outside the diffraction region 
and P was separated out of the integration region by our sphere 2 . Thus the only points at which 
discontinuities appear are outside the region specified in the continuity requirement and v given by 
(27) does satisfy this requirement. 


Returning to equation (26) we note that on the surface A is required rather than v itself. 
We can select coordinates so that the z axis is perpendicular to surface A as shown in Figure A - 5 
so that becomes - jy ( minus sign appears since the positive z direction is opposite to the pos- 
itive n direction). From the geometry of Figure A - 5, r and r' are given as 


r - £x 2 + y 2 + (d — z ) 2 j 1/ 2 
r' = [x 2 +y 2 + (d + z) 2 ]l/2 


(28) 

(29) 
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Figure A-5— Geometry for definition of terms in V. 


Substituting (28) and (29) into (27) we can find 

dV <?V 3 f e ik[x 2 +y 2 +(d+z) 2 3 1/2 e i|c [« 2 +y 2 +(d-*) 2 ] ,/2 '| 

^ dz (Jx 2 + y 2 +(d+ z ) 2 ]*' 2 ” [x 2 +y 2 +(d-z) 2 ] 1 ' /2 J 


■t 


-ik(d + z 1 ) 


d + z 


2k £■ + ~ 1 

x 2 +y 2 + (d+ z) 2 ] [x 2 +y 2 +( d +z) 2 ] 3/2 J 


3 ik tx 2 +y 2 +(d+z ) 2 ] 1/2 


L - ik ( d --z) 

! f __o i o i j \ v ol 


d -z 


I f X 2 + v 2 + (d ~ z^ 2 1 fx 2 + v 2 + { (\ - sr N 2 1 3/2 


j 


i k Cx ^+y " + ( d~z ) 




Now in equation (26) we are integrating over the surface A so that we want the value of on 

( t z 

A which is obtained by setting z ~ o in equation (30) 



<9V 


_ il 


_ 2 q Ik lx 2 +y 2 +d 2 3 1/2 

n, i 

* 

7) n 

A 

d z 

O 

11 

N 

[x 2 +y^ + d 2 ]*/ 2 

£x 2 + y 2 + d 2 j __ [x 2 ^ y 2 + d 2 ] 1//2 . 


(31) 


Equation (31) can be simplified by noting from equation (28) 

r “ [x 2 + y 2 +d 2 ]*' /2 when 


cos 0 


d 

£ x 2 + y 2 + d 2 ] 1/2 


and the geometry of Figure A - 5* 
z ~ 0 (32) 

(33) 


Substituting (32) and (33) into the equation (31) we obtain 


dV 

dn 


d _ V 

dz 


-2e 


ik r 


cos 6 i 


z~0 


[ik- 7 ] 


Now we can substitute (34) into (26) and complete our diffraction formula 


E(P) = co s «[ik-i] 


ds 


where ( refer to Figure A- 5) 


> 


E(P) 


E a 


e 

k 


Electric field at a point P in the diffraction region. 
~ electric field on the plane boundary A. 

= distance from P to a point on A. 

= angle between r and normal to plane A. 

co _ 2rr 
c “ k 


(34) 


(35) 


•Note that in Figure A-5, P and P v were chosen as point on the z axis to obtain equation (32) and (33)- In gene ral x would be replaced 
by (x ~ Xq) and y would be replaced by (y “ yQ) where Xq t y q define the x, y coordinates of the points P and P In the derivation 

the z coordinate of the point P was represented by d to avoid confusion with the coordinates of the point Q. In general the d in equa- 
tions (32) and (33) is replaced by z. Otherwise the general results have the same form as found above. 
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Given the electric field E at every point on the surface A, equation (35) can be used to determine 
the electric field E(P) at any point p in the diffraction region. This statement dept ids, of course, 
on whether the integration indicated in equation ( 35 ) can be performed. If the integration cannot 
be performed analytically, it can be assumed that a numerical solution to any desired accuracy can 
be obtained using a computer. In many problems of interest, satisfactory results can be obtained 
by approximating equation (35) using the geometry of the specific problem. For example, for 
small angles 6 such that cos & - 1 and at great distances r such that 1/ r «k , we can approximate 
equation (35) as 



(36) 


Equation (36) represents Huygens' principle since the contribution from each point on the boundary 
surface is given by E A e ikf /r which describes the spatial variation of a spherical wave. Thus the 
diffracted field as given by equation (36) can be interpreted as the summation (integral) of spherical 
waves radiating from each point on the diffraction boundary. 







